Models

GET /v1/models

List all available models with metering fields (input_cost_per_micro_credit, output_cost_per_micro_credit) used to calculate credit usage for your license.

Headers

Header Required Description
Authorization Yes Bearer <api-key-or-jwt>
X-Quantized-Provider No Force a provider to list models from (openrouter or anthropic)

Request

No request body. This is a GET endpoint.

Examples

cURL
Python
OpenAI SDK
curl https://api.quantized.us/v1/models \
  -H "Authorization: Bearer sk-quantized-YOUR-KEY"
import httpx

response = httpx.get(
    "https://api.quantized.us/v1/models",
    headers={"Authorization": "Bearer sk-quantized-YOUR-KEY"},
)
models = response.json()
for model in models[:5]:
    print(f"{model['id']}: input={model['cost']['input_cost_per_micro_credit']}, "
          f"output={model['cost']['output_cost_per_micro_credit']}")
from openai import OpenAI

client = OpenAI(
    api_key="sk-quantized-YOUR-KEY",
    base_url="https://api.quantized.us/v1",
)

models = client.models.list()
for model in models.data[:5]:
    print(model.id)

Response

[
  {
    "id": "openai/gpt-4.1-mini",
    "name": "GPT-4.1 Mini",
    "cost": {
      "input_cost_per_micro_credit": 0.0016,
      "output_cost_per_micro_credit": 0.0064
    },
    "context_window": 1047576,
    "max_output_tokens": 32768
  },
  {
    "id": "anthropic/claude-sonnet-4-20250514",
    "name": "Claude Sonnet 4",
    "cost": {
      "input_cost_per_micro_credit": 0.048,
      "output_cost_per_micro_credit": 0.24
    },
    "context_window": 200000,
    "max_output_tokens": 16384
  }
]

Response fields

Field Type Description
id string Model identifier (use this in model field of requests)
name string Human-readable model name
cost.input_cost_per_micro_credit number Input-side rate in micro-credits per token for your license
cost.output_cost_per_micro_credit number Output-side rate in micro-credits per token for your license
context_window integer or null Maximum context length in tokens
max_output_tokens integer or null Maximum output length in tokens
License-specific rates

The values returned apply to your license. Another organization may see different numbers for the same model id.

Errors

Status Condition
401 Invalid or missing API key
503 Model catalog provider unavailable