LLM Infrastructure

AI gateways, caching, model routing, observability, edge replication. Practitioner deep-dives from building Prism.

Portkey vs Helicone vs LiteLLM vs OpenRouter: Honest Comparison Honest comparison of the 4 leading LLM gateways in 2026, plus where Prism enters as a new credible alternative. Updated for Fusion + edge replication. May 24, 2026
Anthropic Prompt Caching: Real Numbers From 330 Production Calls Real first-party data: Anthropic prompt caching cut Citare's AI bill 25-35% on parsing-heavy workloads. What works, what doesn't, what burned me $20. May 23, 2026