LLM Infrastructure
AI gateways, caching, model routing, observability, edge replication. Practitioner deep-dives from building Prism.
- Portkey vs Helicone vs LiteLLM vs OpenRouter: Honest Comparison Honest comparison of the 4 leading LLM gateways in 2026, plus where Prism enters as a new credible alternative. Updated for Fusion + edge replication.
- Anthropic Prompt Caching: Real Numbers From 330 Production Calls Real first-party data: Anthropic prompt caching cut Citare's AI bill 25-35% on parsing-heavy workloads. What works, what doesn't, what burned me $20.