Learning Path

Generative AI Learning Path

Your complete pre-reading hub. Seventeen deep-dive study guides covering everything from transformer internals to deploying autonomous multi-agent systems in production — designed for learners who want to go from zero to building production GenAI systems.

00

Everything You Need Before Day One

A practitioner's checklist for getting started with GenAI development — the Python patterns, API skills, terminal comfort, and tooling you need before writing your first LLM application. No math required.

11 min readNotebook
Python IntermediateREST APIsGit & CLIJSON & PydanticEnvironment SetupAsync Basics
01

How Modern AI Actually Works

A practitioner's mental model of how LLMs work — what transformers do, why tokenization affects your costs, how embeddings power search, and what controls your model's behavior. No math required.

27 min readNotebook
Transformer ArchitectureTokenizationEmbeddingsPre-training & RLHFInference Mechanics
02

LLMs, SLMs & Multimodal Models

A practitioner's guide to choosing the right model — when to use large vs small, closed vs open, text-only vs multimodal. Includes cost comparisons, capability tradeoffs, and a decision framework for production model selection.

13 min readNotebook
Model LandscapeKey Models & FamiliesSmall Language ModelsMultimodal CapabilitiesSelection Framework
03

API for Accessing Large Language Models

A practitioner's guide to calling LLM APIs in production — the chat completions interface, provider SDK patterns, streaming, cost management, multi-provider routing with LiteLLM, and structured outputs with Pydantic.

13 min readNotebook
Chat CompletionsProvider SDKsStreamingCost ManagementLiteLLM RoutingStructured Outputs
04

Teaching Models New Behaviors

A practitioner's guide to fine-tuning LLMs — when to fine-tune vs RAG vs prompt, how LoRA and QLoRA work without the math, data preparation checklists, cost comparisons, and the OpenAI and HuggingFace workflows.

13 min readNotebook
LoRA & QLoRARLHF & DPODataset PrepOpenAI Fine-Tuning APIHuggingFace PEFT & TRL
05

LLM Hosting & API Exposure

A practitioner's guide to self-hosting LLMs — when to self-host vs use APIs, how to choose between vLLM, TGI, and Ollama, GPU sizing, cost comparisons, and production deployment patterns with FastAPI, SageMaker, and Docker.

13 min readNotebook
vLLM & TGIOllamaFastAPI WrappersSageMaker EndpointsDocker & ECSGPU Optimization
06

Talking to Models Precisely

A practitioner's guide to production prompt engineering — system prompt design patterns, few-shot template libraries, chain-of-thought techniques, structured output extraction, and prompt security defenses. Patterns that work at scale, not theory.

11 min readNotebook
System PromptsFew-Shot & Many-ShotChain-of-ThoughtReAct & AgentsStructured OutputPrompt Security
07

Retrieval-Augmented Generation

A practitioner's guide to building RAG systems that work in production — when to use RAG, chunking strategy comparison, embedding model selection, vector database tradeoffs, retrieval quality tuning, and a production-readiness checklist.

11 min readNotebook
Document LoadingChunking StrategiesEmbeddingsVector DatabasesRetrieval & RerankingLLM Synthesis
08

Advanced RAG & Multimodal

A practitioner's guide to upgrading basic RAG — hybrid search, reranking, query transformation, parent-child chunks, self-corrective loops, and when each technique is worth the added complexity. Decision frameworks, not algorithms.

10 min readNotebook
Query TransformationParent-Child ChunksGraph RAGMultimodal RAGSelf-Corrective RAGProduction Patterns
09

Agents & Multi-Agent Systems

A practitioner's guide to LLM agents — ReAct patterns, tool definition best practices, when to use agents vs. chains, memory strategies, multi-agent architectures, human-in-the-loop gates, and production guardrails. Patterns that ship, not theory.

11 min readNotebook
ReAct PatternTool Use / Function CallingLangGraph WorkflowsMulti-Agent SystemsHuman-in-the-LoopAgent Safety
10

Evaluation Strategies for LLM Systems

A practitioner's guide to evaluating LLM applications — what to measure, LLM-as-judge patterns, RAGAS for RAG pipelines, regression testing, A/B testing, evaluation pipeline design, and when to invest in human eval. Decision frameworks for building eval infrastructure that earns trust.

12 min readNotebook
LLM-as-JudgeRAGAS FrameworkBenchmarks & MetricsEval-Driven DevHuman Evaluation
11

LLM Guardrails

A practitioner's guide to building production guardrail systems — input validation, output filtering, prompt injection defense, PII detection, and framework selection. Defense in depth for every LLM application.

14 min readNotebook
Input ValidationOutput GuardrailsPrompt Injection DefensePII DetectionContent SafetyGuardrails AI Framework
12

Model Context Protocol: Universal Tool Integration

A practitioner's guide to MCP — what it solves, when to use it vs. direct tool calling, how to build servers with FastMCP, security considerations, and practical setup for Claude Desktop and agent workflows.

15 min readNotebook
MCP ArchitectureTools, Resources & PromptsFastMCP in PythonAgent WorkflowsReal-World ServersSecurity Model
13

AWS Cloud Services for GenAI

A practitioner's guide to deploying GenAI on AWS — Bedrock vs. SageMaker decision framework, serverless architectures, data service selection, security patterns, and cost optimization strategies.

11 min readNotebook
Amazon BedrockSageMakerLambda + API GatewayS3 & DynamoDBIAM & SecurityCost Optimization
14

No-Code Agents with n8n

A practitioner's guide to building AI workflows without code — when no-code makes sense, n8n workflow patterns for LLMs, connecting AI to business tools, limitations vs. custom code, and production deployment.

13 min readNotebook
n8n ArchitectureAI Agent NodesVisual RAG PipelinesWebhook Triggers400+ IntegrationsSelf-Hosted Deploy
15

Capstone I: Document Portal

Build a production-grade document Q&A portal from scratch — document ingestion, chunking, embeddings, vector storage, RAG retrieval, FastAPI backend, and chat frontend. A practitioner's end-to-end project.

12 min readNotebook
Document IngestionVector StoreRAG PipelineFastAPI BackendChat InterfaceDeployment
16

Capstone II: Autonomous Report Agent

Build an autonomous multi-agent system that researches a topic, gathers data from multiple sources, synthesizes findings, and produces a structured report — a practitioner's end-to-end agentic project.

11 min readNotebook
LangGraph WorkflowsMulti-Agent DesignTool IntegrationGuardrailsHuman-in-the-LoopReport Generation