Changelog

v1 — Initial Release

Endpoints

  • POST /v1/chat/completions — OpenAI-compatible chat completions
  • POST /v1/responses — Stateful Responses API
  • POST /v1/web-search — Web search with structured results
  • POST /v1/fetch — Extract text content from URLs
  • GET /v1/models — List available models and pricing
  • GET /v1/license — Check license info and credit balance

Providers

  • OpenRouter — 300+ LLMs (default for chat completions, responses, models)
  • Anthropic — Claude models directly
  • Exa — Web search and content fetch (default)
  • Tavily — Web search and content fetch (alternative)

Features

  • OpenAI SDK compatibility
  • SSE streaming for chat completions and responses
  • Unified credit billing across all providers
  • JWT authentication with auto-provisioning
  • Per-institution configuration
  • Provider routing via X-Quantized-Provider header