Create an AI model configuration.
JWT token from Stytch B2B authentication (magic link, SSO, or M2M)
Schema for creating a new AI model configuration. Enterprise-grade validation with comprehensive field documentation for automatic OpenAPI generation.
ID of the system model to use as the base. User models must reference a system model to inherit pricing and capabilities.
1 - 36"550e8400-e29b-41d4-a716-446655440000"
Your model's human-readable name shown in dashboards and responses (e.g., 'AI models Omni', 'My Finance Model'). Use this to differentiate organization-specific or customized variants.
1 - 200"large-model Omni"
"reasoning-model 3.5 Sonnet"
"Custom Support Model"
Maximum tokens considered at once, including system prompt, message history, user input, tools/functions, and the model's output. Larger windows allow longer context but may increase cost/latency.
1 <= x <= 20000004096
8192
32768
128000
Upper bound for tokens the model may generate in a single response. If not specified, inherits from parent model. Keep this below contextWindow to leave room for prompts and history.
1 <= x <= 2000001000
Whether the provider/model supports SSE (Server-Sent Events) streaming responses.
Enable/disable this model for your organization without deleting it.
ID of a system prompt composition to use for this model. System prompts are composed from particles (role, tone, guardrails, etc.).
"550e8400-e29b-41d4-a716-446655440002"
Full system prompt override (escape hatch). If set, bypasses particle-based composition and uses this text directly.
50000Successful Response
Schema for AI model response data.
Unique identifier for the model
"550e8400-e29b-41d4-a716-446655440000"
Human-readable display name
"large-model Omni"
"reasoning-model 3.5 Sonnet"
Model's context window size in tokens
4096
128000
Whether the model supports streaming
true
Whether the model is enabled
true
Whether this is a company-provided base model
true
false
When the model was created
"2025-09-08T06:33:19Z"
When the model was last updated
"2025-09-08T06:33:19Z"
Model description
"Advanced language model for complex tasks"
AI provider name (NULL for user models, inherited from parent)
"provider-a"
Provider-specific model identifier (NULL for user models, inherited from parent)
"large-modelo"
Maximum tokens per request
4096
Whether the model supports tool/function calling
true
false
Whether the model supports vision/image inputs
true
false
Whether the model supports reasoning/thinking tokens (AI models extended thinking, o1, AI models thinking)
true
false
Credit multiplier for billing (system models only, user models inherit from parent)
1
ID of the parent system model (for user models)
"550e8400-e29b-41d4-a716-446655440001"
ID of the system prompt composition assigned to this model
"550e8400-e29b-41d4-a716-446655440002"
Full system prompt override (if set, bypasses particle composition)
"You are a helpful assistant."