Guides

OpenAI-compatible relay

What is an OpenAI-compatible API relay?

An OpenAI-compatible API relay is a gateway that accepts familiar OpenAI-style requests and adds operational controls around them, such as prepaid balance checks, per-key limits, request logs, and routing policy.

Definition

A relay sits between an application and one or more model providers. The application keeps using OpenAI-compatible endpoints, while the relay handles key policy, metering, logging, and provider routing.

Why developers use a relay

Teams usually add a relay when direct provider access is not enough for cost visibility, team key management, or request-level audit trails.

  • Centralize API keys and model access
  • Stop requests when prepaid balance or budgets are exhausted
  • Export request-level usage for finance or support review

Relay vs direct provider

Direct provider access is simpler for a single developer. A relay is useful when multiple apps, teammates, or customers need shared policy, spend controls, and consistent logs.

FAQ

Does a relay replace the model provider?

No. It adds a gateway layer for policy, logs, routing, and billing controls.

Is OpenAI-compatible the same as OpenAI-owned?

No. It means the API shape is compatible. NexRelay is an independent third-party service.