Intelligent LLM routing for Claude Code

Intelligent LLM routing.
Cut Claude Code costs by 70%

Drop-in proxy that automatically routes simple requests to cost-efficient models and complex ones to premium. One line to set up.

export ANTHROPIC_BASE_URL=https://api.inferroute.ai/v1

How it works

Connect your Claude Code setup in minutes

1

Set the URL

Point your Anthropic client at Inferroute. Two environment variables — your API key and the base URL. Nothing else changes.

2

Code normally

Use Claude Code exactly as before. Inferroute intercepts and classifies each request in real-time.

3

Save ~70%

Simple tasks get routed to cheaper models. Complex reasoning stays on premium. You save automatically.

Pricing

Pay per token. No surprises.

Plans include a monthly credit budget to spend at the rates below.

Current rates
~70% cheaper than direct
Input tokens
$0.20/M
$3.00/M direct
Output tokens
$0.50/M
$15.00/M direct

Free

$0
~500K tokens at current rates
  • 2 API keys
  • Cost-efficient
  • Community support
Get Started
Most Popular

Pro

$19/mo
$25/mo included
~91M tokens at current rates
Effective$0.15/M in·$0.38/M out
  • 10 API keys
  • All models
  • Priority support
Start Free Trial

Max

$49/mo
$70/mo included
~255M tokens at current rates
Effective$0.14/M in·$0.35/M out
  • Unlimited API keys
  • All + priority routing
  • Dedicated support
Start Free Trial

Need more? Top up your balance anytime from $5. Overages are billed at the rates above. Pay with card or USDC on Base (near-zero fees).

Ready to cut your costs?

Join developers who are saving thousands on their Claude Code bills. Set up in under a minute.

Free tier included. No credit card required.