Tag: anthropic

3 articles

· llm / cost-optimization

LLM cost routing: when Haiku beats Opus and when it does not

Routing 1M classification tokens from Opus 4.7 to Haiku 4.5 saves $6.00 — 80% reduction. Here is the task taxonomy, the latency case, and the tools to implement it.

· claude / haiku

Claude Haiku 4.5 for Coding — Benchmark and Cost Guide

At $1/1M tokens and 93 t/s, Haiku 4.5 is the right model for bounded coding — 73.3% SWE-bench Verified, 55% win rate on PR reviews. Here is the task split.

· claude / ai-tools

Claude Opus 4.7 for Coding — When the Big Model Wins

Opus 4.7 leads SWE-bench Verified at 87.6% and scores 70% on CursorBench vs. 58% for Opus 4.6. It costs ~2× Sonnet 4.6 after the tokenizer uplift. Here is exactly when it is worth it.