Evals, Observability & Safety
The #1 senior differentiator. Anyone can demo an agent; seniors can prove it works, see why it fails, and stop it from doing damage. Eval harnesses, tracing, cost dashboards, prompt-injection defense in depth, human-in-the-loop gates, and honest postmortems.
- ▸Design an eval pyramid — deterministic assertions, validated LLM-as-judge, sampled human review — for a real agent
- ▸Validate an LLM judge against human labels and report agreement before trusting it
- ▸Build a regression suite that turns every fixed bug into a CI test case
- ▸Instrument an agent with tracing so every LLM and tool call carries tokens, cost, and latency
- ▸Diagnose a cost regression and a quality regression from traces alone
- ▸Reason about prompt injection with the lethal-trifecta lens and layer real defenses
- ▸Add a human-in-the-loop approval gate to any irreversible action, with an audit log
- ▸Write a blameless postmortem: timeline, root cause, detection gap, fix, regression test
Lessons
Best external resources
Curated reading, docs, and tools that pair with this module.
The essay hiring managers reference. The error-analysis workflow is the job.
EssayOpen-source tracing + evals; what Lab 07 wires into Labs 02 and 05.
DocsThreat model, lethal trifecta, why filters aren't enough.
EssayKnow LLM01 (injection) cold; skim the rest for vocabulary.
GuideSurvey of judge techniques and their measured biases — the depth behind this module's judge table.
EssayA game: extract a password from a defended LLM, level by level. The fastest way to build injection intuition.
Interactive