AI Inference Economics

Training gets the headlines; inference gets the bill. Here's where I dig into the economics of running AI in production — when an API stops being cheaper than self-hosting, where inference should physically live, and how to stop the per-query cost from eating the margin.

20 items across talks, insights, writing, and media.