Pay only for what you use.
No subscription. The open frontier — Kimi, GLM, MiniMax — sits well under market on every bucket: input, cache, and output. Claude (Anthropic) is pass-through on your own key, never marked up.
Open models, per token
vs OpenRouter · 2026-05-31Kimi K2.6
71–86% cheaper, end to end
GLM 5.1
84–92% cheaper, end to end
MiniMax M2.7
25–31% cheaper, end to end
All values in $/M tokens. Strikethrough is the cheapest non-InferRoute access (OpenRouter). Cached is the discounted rate for tokens served from a repeated prefix. * MiniMax M2.7 has no published cache rate, so OpenRouter's is imputed at its typical 20% input ratio.
Works with the Claude you already pay for. Sonnet & Opus pass through your own Anthropic subscription — never marked up, never touched. InferRoute just adds the cheaper open models on top.
Two ways to pay
One account, no subscription
Top up a balance and pay only for what you route — no monthly bill, stop anytime.
- No usage-cap anxiety — never ration prompts or get throttled
- Prepaid credits that never expire
- Top up by card or USDC — 0% fees in USDC
Pay-as-you-go in USDC
Approve once via the open x402 standard. Usage is metered per request and topped up in $1 USDC increments — batched on purpose, so you pay 0 network fees.
- $1 gasless top-ups, drawn down per request — 0% fees
- Non-custodial: you approve, funds stay yours until used
- Built for agents and machine-to-machine spend
Pay only for what you use. No subscription, no commitment.