返回博客
Announcement·2026年5月23日·4 min read

Launching Kunavo — one OpenAI-compatible API for every frontier model, 30% under official price

Why we built a Stripe-native AI API gateway: lower than upstream pricing, one bill across Claude, Gemini, GPT-Image, Veo, and a $2 free credit to get you started.

Today we're launching Kunavo — an OpenAI-compatible AI API gateway built for indie developers and small teams shipping AI features to global users.

The pitch is short: one base URL, one bearer token, 200+ frontier models at 30% under official pricing. Claude, Gemini, GPT-Image, Veo, Sora, Flux, Suno — text, image, video, audio — all behind a single bill in your wallet.

Why another aggregator?

We've shipped enough AI features to know that the pain isn't the SDK. It's the operational tax around it: one contract per provider, per-model billing in different currencies, surprise rate limits on production weekends, "cheaper" aggregators that silently swap your model. Kunavo answers each of those directly.

  • Drop-in OpenAI compatibility. Streaming, function calling, tool use, vision and reasoning all work unmodified. Change one base_url.
  • 30% under official across the catalog. Every enabled model is priced at roughly the provider's official list price minus 30%, with the per-1M-token rate published in plain text on /pricing. No hidden multipliers, failed requests free.
  • Stripe-native checkout. Cards, Apple Pay, Google Pay, ACH, SEPA, Alipay, WeChat Pay — anything Stripe supports. $2 free credit on sign-up, $5 minimum top-up, no subscription, balance never expires.
  • Multi-vendor hot failover. If an upstream goes down on the weekend, we re-route inside 50 ms — before your retry loop even fires.
  • Transparent routing. We never swap your model silently. The usage dashboard shows the exact upstream that served every call.

What you get on day one

  • 50+ models live, including Claude Opus 4.7, Gemini 3 Pro, GPT-Image-2, Veo 3, Nano Banana, Flux 2 — full list at /models.
  • Native Anthropic Messages API alongside the OpenAI chat completions endpoint, so Claude users can keep their SDK.
  • Prompt caching on Claude with our affinity router keeping your cache warm — up to 90% off input tokens on repeat prompts.
  • A self-serve dashboard for keys, usage, billing, and an admin call stream that times every single request.

Three minutes to your first call

first_call.py
from openai import OpenAI

client = OpenAI(
    api_key="sk-kunavo-...",
    base_url="https://api.kunavo.com/v1",  # change only this line
)

# Same SDK. Different model.
resp = client.chat.completions.create(
    model="claude-sonnet-4-6",
    messages=[{"role": "user", "content": "Hello, Kunavo"}],
)
print(resp.choices[0].message.content)

Sign up at kunavo.com/app/signup — you get $2 in credits, enough to put every modality through its paces.

What's next

On the roadmap for the next two months: an in-product migration assistant from OpenAI, Anthropic and OpenRouter; embeddings-aware token batching; and team accounts with per-key budgets and per-environment audit logs. We'll post each ship in /changelog.

Bugs and product feedback land in our inbox at contact@kunavo.com. We answer every email.