Writing on cloud, DevOps, security, and AI engineering, informed by what actually goes wrong in production.
If you ship an LLM feature without evals, you're flying blind. Here's how to set up evaluations that actually catch regressions, in a few hundred lines of code.