Tegy model cost switch

Replacing Qwen with Claude is mostly a 3.5x to 17.4x cost move.

This estimates the same token volume currently served by Qwen3.7 Plus on OpenRouter, priced against Anthropic's current Claude API models.

Token prices

Qwen3.7 Plus via OpenRouter

$0.32 / $1.28 input / output MTok

Claude Haiku 4.5

$1 / $5 input / output MTok

Claude Sonnet 4.6

$3 / $15 input / output MTok

Claude Opus 4.8

$5 / $25 input / output MTok

Same traffic, same mix

Qwen

1.0x

Haiku

3.5x

Sonnet

10.4x

Opus

17.4x

If the workload is output-heavy, the Anthropic premium rises slightly. If repeated context is cacheable, Anthropic prompt caching can materially reduce the input side.

Formula

Qwen = input_M * 0.32 + output_M * 1.28
Haiku = input_M * 1 + output_M * 5
Sonnet = input_M * 3 + output_M * 15
Opus = input_M * 5 + output_M * 25

Exact totals require the OpenRouter or Cloudflare AI Gateway usage export split by input and output tokens.

Budget examples

Qwen spend	Haiku	Sonnet	Opus
$10	$34.72	$104.17	$173.61
$25	$86.81	$260.42	$434.03
$50	$173.61	$520.83	$868.06

Decision read

Haiku Closest Anthropic cost option. Still roughly 3.5x Qwen for this agent-style mix.

Sonnet Likely quality upgrade path, but budget should assume about 10x the same Qwen traffic.

Opus Reserve for escalation, evals, or high-value turns unless caching changes the economics.