Writing on cloud, DevOps, security, and AI engineering, informed by what actually goes wrong in production.
Token spend on LLM features can go non-linear quickly. Here are the patterns I use to keep Azure OpenAI bills predictable without compromising the experience.