Engineering

Evals before features: how we ship LLM systems

Our internal playbook for shipping AI features safely — and why eval harnesses are the first PR, not the last.

Imran Ali · Principal Engineer · February 19, 2026 · 9 min read

There is a quiet pattern in every AI engagement we've successfully shipped: the eval harness was the first thing we built.

Want more?

Subscribe for one thoughtful essay a month — written by the engineers and designers doing the work.