Front Matter

Preface

This book is a field-first mastery text on modern ML systems — a holy-grail reference for interview preparation and deep production work. It covers ML foundations, training and fine-tuning, inference internals, production serving, retrieval and RAG, agents, distributed systems, data plane, observability, build/deploy/operate, and a complete ML system design interview playbook.

The ML and inference track (Parts I–III) ramps Beginner → Expert → PhD → Production. Each layer is built on the one beneath it, so the production choices read as the natural answer to the research-level questions. The systems and MLOps tracks (Parts IV–IX) assume a strong generalist baseline and go deep on tradeoffs, design rationale, and "why this and not that." Part X is the capstone interview playbook.

Every chapter ends with Read it yourself (curated references) and Practice (exercises including a stretch problem). Use them.

Reading paths

The mastery path (slowest, deepest)

Front to back, every chapter. The "I want to be the best" path.

The interview-prep path (focused)

56 chapters that cover what interviewers actually test. Start here if you have 2–4 weeks.

Part III — Inference
Part X — Interview Playbook

Don't skip: Appendix E — 530+ interview questions organized by topic, difficulty, and role.

Conventions

  • Code snippets are in Python or pseudocode unless the topic is language-specific.
  • Code examples are real, lifted from production patterns and adapted — names stripped, identifying paths removed.
  • Diagrams use inline SVG and Mermaid — they adapt to light/dark mode automatically.