Primary navigation

Legacy APIs

Compare models
Best intelligence at scale for agentic, coding, and professional workflows
Reasoning
Speed
Input
Output
Reasoning tokens
Pricing
Per 1M tokens
Input
$2.50
Cached Input
$0.25
Output
$15.00
Context
Window
1,050,000
Max Output Tokens
128,000
Knowledge Cutoff
Aug 31, 2025
Endpoints
v1/chat/completions
v1/responses
v1/assistants
v1/batch
v1/fine-tuning
Supported Features
Streaming
Function calling
Structured outputs
Fine-tuning
Distillation
Predicted outputs
Image input
Rate Limits
TPM
Free
-
Tier 1
500,000
Tier 2
1,000,000
Tier 3
2,000,000
Tier 4
4,000,000
Tier 5
40,000,000
Smartest non-reasoning model
Intelligence
Speed
Input
Output
Reasoning tokens
Pricing
Per 1M tokens
Input
$2.00
Cached Input
$0.50
Output
$8.00
Context
Window
1,047,576
Max Output Tokens
32,768
Knowledge Cutoff
Jun 01, 2024
Endpoints
v1/chat/completions
v1/responses
v1/assistants
v1/batch
v1/fine-tuning
Supported Features
Streaming
Function calling
Structured outputs
Fine-tuning
Distillation
Predicted outputs
Image input
Rate Limits
TPM
Free
-
Tier 1
30,000
Tier 2
450,000
Tier 3
800,000
Tier 4
2,000,000
Tier 5
30,000,000