Inference Gateway

One API, every model

Drop-in replacement for OpenAI SDK. Access Qwen, Llama, and more through a single endpoint with automatic fallback, usage tracking, and simple pay-as-you-go billing.

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.your-gateway.com/v1",
  apiKey: "sk-your-key",
});

const res = await client.chat.completions.create({
  model: "qwen3-235b",
  messages: [{ role: "user", content: "Hello!" }],
});