Models
Models
GET /v1/models
List all available models with metering fields (input_cost_per_micro_credit, output_cost_per_micro_credit) used to calculate credit usage for your license.
Headers
| Header | Required | Description |
|---|---|---|
Authorization |
Yes | Bearer <api-key-or-jwt> |
X-Quantized-Provider |
No | Force a provider to list models from (openrouter or anthropic) |
Request
No request body. This is a GET endpoint.
Examples
cURL
Python
OpenAI SDK
curl https://api.quantized.us/v1/models \
-H "Authorization: Bearer sk-quantized-YOUR-KEY"
import httpx
response = httpx.get(
"https://api.quantized.us/v1/models",
headers={"Authorization": "Bearer sk-quantized-YOUR-KEY"},
)
models = response.json()
for model in models[:5]:
print(f"{model['id']}: input={model['cost']['input_cost_per_micro_credit']}, "
f"output={model['cost']['output_cost_per_micro_credit']}")
from openai import OpenAI
client = OpenAI(
api_key="sk-quantized-YOUR-KEY",
base_url="https://api.quantized.us/v1",
)
models = client.models.list()
for model in models.data[:5]:
print(model.id)
Response
[
{
"id": "openai/gpt-4.1-mini",
"name": "GPT-4.1 Mini",
"cost": {
"input_cost_per_micro_credit": 0.0016,
"output_cost_per_micro_credit": 0.0064
},
"context_window": 1047576,
"max_output_tokens": 32768
},
{
"id": "anthropic/claude-sonnet-4-20250514",
"name": "Claude Sonnet 4",
"cost": {
"input_cost_per_micro_credit": 0.048,
"output_cost_per_micro_credit": 0.24
},
"context_window": 200000,
"max_output_tokens": 16384
}
]
Response fields
| Field | Type | Description |
|---|---|---|
id |
string | Model identifier (use this in model field of requests) |
name |
string | Human-readable model name |
cost.input_cost_per_micro_credit |
number | Input-side rate in micro-credits per token for your license |
cost.output_cost_per_micro_credit |
number | Output-side rate in micro-credits per token for your license |
context_window |
integer or null | Maximum context length in tokens |
max_output_tokens |
integer or null | Maximum output length in tokens |
License-specific rates
The values returned apply to your license. Another organization may see different numbers for the same model id.
Errors
| Status | Condition |
|---|---|
401 |
Invalid or missing API key |
503 |
Model catalog provider unavailable |