Engineering notes on AI, ML, custom apps, and the messy bits of shipping production software.
A practical, ordered playbook for lowering production LLM costs. Prompt hygiene, caching, model routing, retrieval, budgets, and the measurement that ties it all together.