Agent Engineering
9 interactive modules. Production-ready agent engineering — picks up where AI Agents Foundations ends. Free, browser-based.
Idempotency, checkpoints, retry policy, durable execution platforms — surviving crashes, deploys, and network failures.
Span-per-tick traces, what to log, replay, golden metrics vs vanity, alerting on stochastic systems.
Defense-in-depth: input filters, output filters, policy enforcement, fail-safe vs fail-open.
Where the tokens go, prompt caching, result caching, parallel tool calls, batching at the agent layer.
Online vs offline evals, shadow traffic, A/B harness, drift detection, eval-driven rollout.
Prompts as code, canary rollouts, rolling releases, version pinning, rollback discipline.
The first 15 minutes, trace replay in anger, root-cause discipline, postmortems, drills.
Multi-agent orchestration — when teams beat a single agent, supervisor/worker, voting, handoffs, coordination cost.
SLOs, error budgets, on-call rotations, runbooks. From Foundations to running an agent fleet.