· llm / cost-optimization
LLM cost routing: when Haiku beats Opus and when it does not
Routing 1M classification tokens from Opus 4.7 to Haiku 4.5 saves $6.00 — 80% reduction. Here is the task taxonomy, the latency case, and the tools to implement it.