Models
List the LLM models available for use with the Vexrail API. The response follows the OpenAI models list format.
Endpoint
GET /v1/models
Authentication
Requires x-publishable-key and x-secret-key headers. See Authentication.
Example Request
curl https://api.vexrail.com/v1/models \
-H "x-publishable-key: pk_live_your_key" \
-H "x-secret-key: sk_live_your_key"
Example Response
{
"object": "list",
"data": [
{
"id": "gpt-4o-mini",
"object": "model",
"created": 1700000000,
"owned_by": "openai"
},
{
"id": "gpt-4o",
"object": "model",
"created": 1700000000,
"owned_by": "openai"
},
{
"id": "claude-3-5-sonnet-20241022",
"object": "model",
"created": 1700000000,
"owned_by": "anthropic"
}
]
}
Model Selection
When making a chat completion request, specify the model using its id value from this endpoint.
{
"model": "gpt-4o-mini",
"messages": [...]
}
Model Availability
The list of available models may change over time as new models are added or deprecated. The models endpoint response is cached for 10 minutes, so newly added models may take up to 10 minutes to appear.
Choosing a Model
Different models have different capabilities, performance characteristics, and costs:
- Smaller models (e.g.,
gpt-4o-mini) are faster and cheaper per token. Good for simple queries, high-volume applications, or when cost is a priority. - Larger models (e.g.,
gpt-4o) provide higher quality responses for complex reasoning tasks but cost more per token.
Token costs are charged against your credit wallet.