Tag: cost-optimization

1 article

May 17, 2026 · llm / cost-optimization

LLM cost routing: when Haiku beats Opus and when it does not

Routing 1M classification tokens from Opus 4.7 to Haiku 4.5 saves $6.00 — 80% reduction. Here is the task taxonomy, the latency case, and the tools to implement it.