Today we're launching Kunavo — an OpenAI-compatible AI API gateway built for indie developers and small teams shipping AI features to global users.

The pitch is short: one base URL, one bearer token, 200+ frontier models at 30% under official pricing. Claude, Gemini, GPT-Image, Veo, Sora, Flux, Suno — text, image, video, audio — all behind a single bill in your wallet.

Why another aggregator?

We've shipped enough AI features to know that the pain isn't the SDK. It's the operational tax around it: one contract per provider, per-model billing in different currencies, surprise rate limits on production weekends, "cheaper" aggregators that silently swap your model. Kunavo answers each of those directly.

Drop-in OpenAI compatibility. Streaming, function calling, tool use, vision and reasoning all work unmodified. Change one base_url.
30% under official across the catalog. Every enabled model is priced at roughly the provider's official list price minus 30%, with the per-1M-token rate published in plain text on /pricing. No hidden multipliers, failed requests free.
Stripe-native checkout. Cards, Apple Pay, Google Pay, ACH, SEPA, Alipay, WeChat Pay — anything Stripe supports. $2 free credit on sign-up, $5 minimum top-up, no subscription, balance never expires.
Multi-vendor hot failover. If an upstream goes down on the weekend, we re-route inside 50 ms — before your retry loop even fires.
Transparent routing. We never swap your model silently. The usage dashboard shows the exact upstream that served every call.

What you get on day one

50+ models live, including Claude Opus 4.7, Gemini 3 Pro, GPT-Image-2, Veo 3, Nano Banana, Flux 2 — full list at /models.
Native Anthropic Messages API alongside the OpenAI chat completions endpoint, so Claude users can keep their SDK.
Prompt caching on Claude with our affinity router keeping your cache warm — up to 90% off input tokens on repeat prompts.
A self-serve dashboard for keys, usage, billing, and an admin call stream that times every single request.

Three minutes to your first call

first_call.py

from openai import OpenAI

client = OpenAI(
    api_key="sk-kunavo-...",
    base_url="https://api.kunavo.com/v1",  # change only this line
)

# Same SDK. Different model.
resp = client.chat.completions.create(
    model="claude-sonnet-4-6",
    messages=[{"role": "user", "content": "Hello, Kunavo"}],
)
print(resp.choices[0].message.content)

What's next

On the roadmap for the next two months: an in-product migration assistant from OpenAI, Anthropic and OpenRouter; embeddings-aware token batching; and team accounts with per-key budgets and per-environment audit logs. We'll post each ship in /changelog.

Bugs and product feedback land in our inbox at contact@kunavo.com. We answer every email.

Launching Kunavo — one OpenAI-compatible API for every frontier model, 30% under official price

Why another aggregator?

What you get on day one

Three minutes to your first call

What's next

Related deep dives

Image models shootout — Nano Banana Pro vs Flux 2 Pro vs Seedream V4

Engineering a 99.95% SLO for an AI API gateway — failover, watchdogs, and the boring stuff

Anthropic prompt caching — cut 90% off your input bill in 30 minutes

Migrating from OpenAI to Kunavo in 10 minutes — Python, Node, LangChain, Vercel AI SDK