Learning Path

Generative AI Learning Path

Your complete pre-reading hub. Seventeen deep-dive study guides covering everything from transformer internals to deploying autonomous multi-agent systems in production — designed for learners who want to go from zero to building production GenAI systems.

Everything You Need Before Day One

A practitioner's checklist for getting started with GenAI development — the Python patterns, API skills, terminal comfort, and tooling you need before writing your first LLM application. No math required.

11 min readNotebook

Python IntermediateREST APIsGit & CLIJSON & PydanticEnvironment SetupAsync Basics

How Modern AI Actually Works

A practitioner's mental model of how LLMs work — what transformers do, why tokenization affects your costs, how embeddings power search, and what controls your model's behavior. No math required.

27 min readNotebook

Transformer ArchitectureTokenizationEmbeddingsPre-training & RLHFInference Mechanics

LLMs, SLMs & Multimodal Models

A practitioner's guide to choosing the right model — when to use large vs small, closed vs open, text-only vs multimodal. Includes cost comparisons, capability tradeoffs, and a decision framework for production model selection.

13 min readNotebook

Model LandscapeKey Models & FamiliesSmall Language ModelsMultimodal CapabilitiesSelection Framework

API for Accessing Large Language Models

A practitioner's guide to calling LLM APIs in production — the chat completions interface, provider SDK patterns, streaming, cost management, multi-provider routing with LiteLLM, and structured outputs with Pydantic.

13 min readNotebook

Chat CompletionsProvider SDKsStreamingCost ManagementLiteLLM RoutingStructured Outputs

Teaching Models New Behaviors

A practitioner's guide to fine-tuning LLMs — when to fine-tune vs RAG vs prompt, how LoRA and QLoRA work without the math, data preparation checklists, cost comparisons, and the OpenAI and HuggingFace workflows.

13 min readNotebook

LoRA & QLoRARLHF & DPODataset PrepOpenAI Fine-Tuning APIHuggingFace PEFT & TRL

LLM Hosting & API Exposure

A practitioner's guide to self-hosting LLMs — when to self-host vs use APIs, how to choose between vLLM, TGI, and Ollama, GPU sizing, cost comparisons, and production deployment patterns with FastAPI, SageMaker, and Docker.

13 min readNotebook

vLLM & TGIOllamaFastAPI WrappersSageMaker EndpointsDocker & ECSGPU Optimization

Talking to Models Precisely

A practitioner's guide to production prompt engineering — system prompt design patterns, few-shot template libraries, chain-of-thought techniques, structured output extraction, and prompt security defenses. Patterns that work at scale, not theory.

11 min readNotebook

System PromptsFew-Shot & Many-ShotChain-of-ThoughtReAct & AgentsStructured OutputPrompt Security

Retrieval-Augmented Generation

A practitioner's guide to building RAG systems that work in production — when to use RAG, chunking strategy comparison, embedding model selection, vector database tradeoffs, retrieval quality tuning, and a production-readiness checklist.

11 min readNotebook

Document LoadingChunking StrategiesEmbeddingsVector DatabasesRetrieval & RerankingLLM Synthesis

Advanced RAG & Multimodal

A practitioner's guide to upgrading basic RAG — hybrid search, reranking, query transformation, parent-child chunks, self-corrective loops, and when each technique is worth the added complexity. Decision frameworks, not algorithms.

10 min readNotebook

Query TransformationParent-Child ChunksGraph RAGMultimodal RAGSelf-Corrective RAGProduction Patterns

Agents & Multi-Agent Systems

A practitioner's guide to LLM agents — ReAct patterns, tool definition best practices, when to use agents vs. chains, memory strategies, multi-agent architectures, human-in-the-loop gates, and production guardrails. Patterns that ship, not theory.

11 min readNotebook

ReAct PatternTool Use / Function CallingLangGraph WorkflowsMulti-Agent SystemsHuman-in-the-LoopAgent Safety

Evaluation Strategies for LLM Systems

A practitioner's guide to evaluating LLM applications — what to measure, LLM-as-judge patterns, RAGAS for RAG pipelines, regression testing, A/B testing, evaluation pipeline design, and when to invest in human eval. Decision frameworks for building eval infrastructure that earns trust.

12 min readNotebook

LLM-as-JudgeRAGAS FrameworkBenchmarks & MetricsEval-Driven DevHuman Evaluation

LLM Guardrails

A practitioner's guide to building production guardrail systems — input validation, output filtering, prompt injection defense, PII detection, and framework selection. Defense in depth for every LLM application.

14 min readNotebook

Input ValidationOutput GuardrailsPrompt Injection DefensePII DetectionContent SafetyGuardrails AI Framework

Model Context Protocol: Universal Tool Integration

A practitioner's guide to MCP — what it solves, when to use it vs. direct tool calling, how to build servers with FastMCP, security considerations, and practical setup for Claude Desktop and agent workflows.

15 min readNotebook

MCP ArchitectureTools, Resources & PromptsFastMCP in PythonAgent WorkflowsReal-World ServersSecurity Model

AWS Cloud Services for GenAI

A practitioner's guide to deploying GenAI on AWS — Bedrock vs. SageMaker decision framework, serverless architectures, data service selection, security patterns, and cost optimization strategies.

11 min readNotebook

Amazon BedrockSageMakerLambda + API GatewayS3 & DynamoDBIAM & SecurityCost Optimization

No-Code Agents with n8n

A practitioner's guide to building AI workflows without code — when no-code makes sense, n8n workflow patterns for LLMs, connecting AI to business tools, limitations vs. custom code, and production deployment.

13 min readNotebook

n8n ArchitectureAI Agent NodesVisual RAG PipelinesWebhook Triggers400+ IntegrationsSelf-Hosted Deploy

Capstone I: Document Portal

Build a production-grade document Q&A portal from scratch — document ingestion, chunking, embeddings, vector storage, RAG retrieval, FastAPI backend, and chat frontend. A practitioner's end-to-end project.

12 min readNotebook

Document IngestionVector StoreRAG PipelineFastAPI BackendChat InterfaceDeployment

Capstone II: Autonomous Report Agent

Build an autonomous multi-agent system that researches a topic, gathers data from multiple sources, synthesizes findings, and produces a structured report — a practitioner's end-to-end agentic project.

11 min readNotebook

LangGraph WorkflowsMulti-Agent DesignTool IntegrationGuardrailsHuman-in-the-LoopReport Generation

Explore Use Cases