AI Demystified
🗺️ Learning Roadmap
Track 0: Grounding
Get oriented — understand the AI landscape before diving in
Parts 0–0 · 1 published
Track 1: Foundation
How AI works — the pipeline, concepts, and limitations
Parts 1–8 · 8 published
Track 2: Practice
How to use AI — prompt engineering, embeddings, RAG, and fine-tuning
Parts 9–15 · 7 published
Track 3: Hands-On
How to build with AI — coding setup, tool calling, and hands-on projects
Parts 16–99 · 0 published
- Track 0: Grounding
- 0 →
5 Mental Models You Need Before Diving Into AI
Before you learn how AI works, learn how to think about it. These five mental models will save you hours of confusion.
- Track 1: Foundation
- 1 →
What Happens When You Ask AI Something?
You type a message. Half a second later, AI replies. Here's every single step that happens in between — from your first letter to the model's last word.
- 2 →
Transformers — The Architecture That Changed Everything
Every major AI — GPT, Claude, Gemini, Llama — runs on the Transformer. Here's how it works, stage by stage, with live visualizations.
- 3 →
How AI Learns, Thinks, and Decides
Training, inference, sampling, fine-tuning — these words are everywhere. Here's what they actually mean, with live visualizations and honest analogies.
- 4 →
How AI Reads Your Words
AI doesn't read letters or words — it reads tokens. This small detail explains why AI costs money, why it stumbles on names, and why non-English is more expensive.
- 5 →
Why AI Forgets
Mid-conversation, AI suddenly doesn't remember what you said earlier. This isn't a bug — it's the context window. Here's how it works and how to work around it.
- 6 →
Why AI Lies (And Doesn't Know It)
AI sometimes states false things with complete confidence — fake citations, wrong numbers, invented APIs. This is hallucination. Here's why it happens and how to catch it.
- 7 →
What AI Cannot Do
8 structural limitations of every language model — not bugs, not temporary — built into how they work. Illustrated.
- 8 →
How AI Reasons (And Why It Sometimes Breaks)
o1, o3, DeepSeek-R1 — reasoning models behave differently. What chain-of-thought actually is, what 'thinking longer' means, and where it still fails.
- Track 2: Practice
- 9 →
Prompt Engineering — How to Talk to AI
Five techniques that cover 95% of real-world prompt engineering. Zero-shot, few-shot, role prompting, chain-of-thought, and structured output — with before/after examples.
- 10 →
Embeddings & Vector Databases — The Memory Layer of AI
How neural networks encode meaning as numbers, why similar things end up close in vector space, and what databases like Pinecone and pgvector actually do.
- 11 →
RAG Explained — How AI Knows What You Didn't Train It On
Retrieval-Augmented Generation lets LLMs answer questions about documents they've never seen. Here's how the pipeline works and when to use it.
- 12 →
Fine-tuning vs. Prompting — When to Use Which
Prompt engineering gets you 80% of the way. Fine-tuning gets you the last 20%. Here's a clear decision framework for choosing between them.
- 13 →
Do You Really Need GPT-4?
Frontier models are impressive — and often the wrong choice. Here's the model selection map: cost, capability, latency, and when local wins.
- 14 →
Latency, Tokens, and Cost — The Physics of AI Products
Why is AI slow? Why does it cost money? What does streaming actually change? The mechanics of inference, visualized.
- 15 →
How Do You Know AI Is Actually Working?
Demos always look good. Production AI degrades silently. Here's the evaluation framework — from exact match to human review — and how to catch hallucinations.
- Track 3: Hands-On
- 16 coming soon
Coding Setup — Your AI Development Environment
Set up Python, run real AI code, and implement tokenization, embeddings, RAG, and temperature sampling — all from scratch.
- 17 coming soon
MCP Tool Calling — How AI Uses Tools
Model Context Protocol lets LLMs interact with the outside world through tools. Here's how the protocol works and what it enables.
- 18 coming soon
AI Agents — Beyond Chatbots
Agents plan, act, observe, and retry. Here's the planner-executor loop animated — with a research task, a coding task, and what a real failure looks like.
- 19 coming soon
Build Your First Real AI App
From blank repo to working RAG system — implementing embeddings, vector search, streaming, structured output, and an eval suite in Python.
- 20 coming soon
Token Optimization — Spend Less, Get More
Every AI API call is billed in tokens. Here's a practical toolkit for cutting cost and latency without cutting quality.