What you get
- OpenAI-compatible
/v1/chat/completionsand/v1/embeddingsso your current SDKs work unchanged. - Governance built in — teams, virtual keys, and per-key rate limits.
- A single source of truth for the API: the OpenAPI 3.1 spec that this site renders directly.
Where to start
Quickstart
Run your first inference request in under five minutes.
Authentication
Bearer tokens, virtual keys, and admin keys.
API Reference
Every endpoint, every parameter — generated from the spec.
Errors
HTTP status codes and what to do about them.
This documentation site is in Dogfooding. The Phase 1 surface (Get
Started + API Reference) is live; the deeper Reference pages will fill in
during Alpha.