📋 Full Curriculum
All 30 days at a glance. Status: Ready = content complete,
Stub = structure ready, needs research expansion.
| Day | Title | Topic | Status |
| Day 1 |
ML System Blueprint |
The three pillars: compute, storage, orchestration |
Ready
|
| Day 2 |
Sync vs Async Inference |
Request-response, batch, streaming — tradeoffs in latency, throughput, cost |
Ready
|
| Day 3 |
Caching Strategies |
Semantic caching, KV-cache reuse, prompt caching |
Ready
|
| Day 4 |
Load Balancing & Routing |
Round-robin, least-connections, semantic routers |
Ready
|
| Day 5 |
Stateless vs Stateful Inference |
KV cache, conversation history, managing state at scale |
Ready
|
| Day 6 |
Rate Limiting & Quotas |
Token bucket, sliding window, user-tier enforcement |
Ready
|
| Day 7 |
Mini-Project — AI Gateway |
Containerise your proxy, cache, and rate limiter into a unified gateway |
Ready
|
| Day | Title | Topic | Status |
| Day 8 |
Data Pipelines |
Extract, transform, embed, store — batch vs streaming |
Stub
|
| Day 9 |
Vector Databases |
Index types (IVF, HNSW), tradeoffs, hybrid search |
Stub
|
| Day 10 |
RAG Architecture |
Ingestion pipeline, retriever, reranker, generator |
Stub
|
| Day 11 |
Distributed Training 101 |
Data parallelism, model parallelism, pipeline parallelism |
Stub
|
| Day 12 |
Checkpointing & Fault Tolerance |
Save/restore training state, preemption handling, spot instances |
Stub
|
| Day 13 |
Experiment Tracking |
Structured logging, hyperparameter sweeps, model registry |
Stub
|
| Day 14 |
Mini-Project — End-to-End RAG |
Integrate pipeline + vector DB + LLM into a complete RAG system |
Stub
|
| Day | Title | Topic | Status |
| Day 15 |
Inference Optimization |
Quantization (GGUF, GPTQ, AWQ), throughput, quality tradeoffs |
Stub
|
| Day 16 |
Continuous Batching & Speculative Decoding |
How vLLM/TGI achieve high throughput |
Stub
|
| Day 17 |
Prefill vs Decode |
The two phases of transformer inference |
Stub
|
| Day 18 |
GPU vs CPU Offloading |
Layer placement, PCIe bottlenecks, memory hierarchy |
Stub
|
| Day 19 |
Streaming & SSE |
Token streaming fan-out, head-of-line blocking prevention |
Stub
|
| Day 20 |
Model Adapters & LoRA |
Adapter swapping, multi-task serving, parameter-efficient fine-tuning |
Stub
|
| Day 21 |
Mini-Project — Inference Benchmark Suite |
Script that sweeps parameters and produces a comparison table |
Stub
|
| Day | Title | Topic | Status |
| Day 22 |
Observability |
Metrics, traces, logs — the three pillars for AI systems |
Stub
|
| Day 23 |
Guardrails & Safety |
Input/output filtering, PII detection, prompt injection defense |
Stub
|
| Day 24 |
A/B Testing & Canary Deployments |
Shadow traffic, gradual rollout, automated rollback |
Stub
|
| Day 25 |
Case Study: ChatGPT |
The infrastructure behind a global chat product |
Stub
|
| Day 26 |
Case Study: Perplexity |
Real-time web search + RAG at scale |
Stub
|
| Day 27 |
Case Study: GitHub Copilot |
Context window management, code-specific embeddings, fast completion |
Stub
|
| Day 28 |
Cost Engineering |
Token economics, cache hit rates, model selection by task difficulty |
Stub
|
| Day 29 |
Scaling Law Intuition |
How data/compute affects system cost — hardware vs optimisation |
Stub
|
| Day 30 |
Final Project: Design a Production AI System |
End-to-end architecture design, from concept to deployment |
Stub
|