AI Pricing Table

Compare API costs across all major AI providers in one comprehensive table

View detailed pricing for OpenAI, Claude, Gemini, and other AI providers in an easy-to-read table format. Compare input (prompts) and output (generated content) per 1 million tokens costs across different models to find the most cost-effective AI solution for your needs.

Prices are shown in USD per 1 million tokens. Token counts vary by model and language, but as a rough guide:
1 token ≈ 4 characters or ¾ of a word in English.

Last updated: March 2026

Model Prices
Per 1M tokens

Provider
Model
Context Window
Input Price
Output Price
Cached Input Price
Price Range
OpenAIGPT-5.4 Nano400K$0.200$1.250$0.020Low
OpenAIGPT-5.4 Mini400K$0.750$4.500$0.070Low
OpenAIGPT-5.4 Pro1.05M$30.000$180.000N/AHigh
OpenAIGPT-5.41.05M$2.500$15.000$0.250Mid
OpenAIGPT-5.3 Chat128K$1.750$14.000$0.170Mid
OpenAIGPT-5.3-Codex400K$1.750$14.000$0.170Mid
OpenAIGPT-5.2-Codex400K$1.750$14.000$0.170Mid
OpenAIGPT-5.2 Chat128K$1.750$14.000$0.170Mid
OpenAIGPT-5.2 Pro400K$21.000$168.000N/AHigh
OpenAIGPT-5.2400K$1.750$14.000$0.170Mid
OpenAIGPT-5.1-Codex-Max400K$1.250$10.000$0.130Mid
OpenAIGPT-5.1400K$1.250$10.000$0.130Mid
OpenAIGPT-5.1 Chat128K$1.250$10.000$0.130Mid
OpenAIGPT-5.1-Codex400K$1.250$10.000$0.130Mid
OpenAIGPT-5.1-Codex-Mini400K$0.250$2.000$0.030Low
OpenAIo3 Deep Research200K$10.000$40.000$2.500High
OpenAIo4 Mini Deep Research200K$2.000$8.000$0.500Mid
OpenAIGPT-5 Pro400K$15.000$120.000N/AHigh
OpenAIGPT-5 Codex400K$1.250$10.000$0.130Mid
OpenAIGPT-5 Chat128K$1.250$10.000$0.130Mid
OpenAIGPT-5400K$1.250$10.000$0.130Mid
OpenAIGPT-5 Mini400K$0.250$2.000$0.020Low
OpenAIGPT-5 Nano400K$0.050$0.400$0.010Low
OpenAIo3 Pro200K$20.000$80.000N/AHigh
OpenAIo4 Mini High200K$1.100$4.400$0.280Mid
OpenAIo3200K$2.000$8.000$0.500Mid
OpenAIo4 Mini200K$1.100$4.400$0.280Mid
OpenAIGPT-4.11.05M$2.000$8.000$0.500Mid
OpenAIGPT-4.1 Mini1.05M$0.400$1.600$0.100Low
OpenAIGPT-4.1 Nano1.05M$0.100$0.400$0.020Low
OpenAIo1-pro200K$150.000$600.000N/AHigh
OpenAIGPT-4o-mini Search Preview128K$0.150$0.600N/ALow
OpenAIGPT-4o Search Preview128K$2.500$10.000N/AMid
OpenAIo3 Mini High200K$1.100$4.400$0.550Mid
OpenAIo3 Mini200K$1.100$4.400$0.550Mid
OpenAIo1200K$15.000$60.000$7.500High
OpenAIGPT-4o-mini128K$0.150$0.600$0.070Low
OpenAIGPT-4o128K$2.500$10.000N/AMid
OpenAIGPT-4o (extended)128K$6.000$18.000N/AMid
GoogleGemini 3.1 Flash Lite Preview1.05M$0.250$1.500$0.020Low
GoogleGemini 3.1 Pro Preview Custom Tools1.05M$2.000$12.000$0.200Mid
GoogleGemini 3.1 Pro Preview1.05M$2.000$12.000$0.200Mid
GoogleGemini 3 Flash Preview1.05M$0.500$3.000$0.050Low
GoogleGemini 2.5 Flash Lite Preview 09-20251.05M$0.100$0.400$0.010Low
GoogleGemini 2.5 Flash Lite1.05M$0.100$0.400$0.010Low
GoogleGemini 2.5 Flash1.05M$0.300$2.500$0.030Low
GoogleGemini 2.5 Pro1.05M$1.250$10.000$0.130Mid
GoogleGemini 2.5 Pro Preview 06-051.05M$1.250$10.000$0.130Mid
GoogleGemini 2.5 Pro Preview 05-061.05M$1.250$10.000$0.130Mid
AnthropicClaude Opus 4.71M$5.000$25.000$0.500Mid
AnthropicClaude Opus 4.6 (Fast)1M$30.000$150.000$3.000High
AnthropicClaude Sonnet 4.61M$3.000$15.000$0.300Mid
AnthropicClaude Opus 4.61M$5.000$25.000$0.500Mid
AnthropicClaude Opus 4.5200K$5.000$25.000$0.500Mid
AnthropicClaude Haiku 4.5200K$1.000$5.000$0.100Mid
AnthropicClaude Sonnet 4.51M$3.000$15.000$0.300Mid
AnthropicClaude Opus 4.1200K$15.000$75.000$1.500High
AnthropicClaude Opus 4200K$15.000$75.000$1.500High
AnthropicClaude Sonnet 41M$3.000$15.000$0.300Mid
AnthropicClaude 3.5 Haiku200K$0.800$4.000$0.080Low
AnthropicClaude 3 Haiku200K$0.250$1.250$0.030Low
xAIGrok 4.20 Multi-Agent2M$2.000$6.000$0.200Mid
xAIGrok 4.202M$2.000$6.000$0.200Mid
xAIGrok 4.1 Fast2M$0.200$0.500$0.050Low
xAIGrok 4 Fast2M$0.200$0.500$0.050Low
xAIGrok Code Fast 1256K$0.200$1.500$0.020Low
xAIGrok 4256K$3.000$15.000$0.750Mid
xAIGrok 3 Mini131K$0.300$0.500$0.070Low
xAIGrok 3131K$3.000$15.000$0.750Mid
xAIGrok 3 Mini Beta131K$0.300$0.500$0.070Low
xAIGrok 3 Beta131K$3.000$15.000$0.750Mid
MistralMistral Small 4262K$0.150$0.600$0.010Low
MistralDevstral 2 2512262K$0.400$2.000$0.040Low
MistralMinistral 3 14B 2512262K$0.200$0.200$0.020Low
MistralMinistral 3 8B 2512262K$0.150$0.150$0.010Low
MistralMinistral 3 3B 2512131K$0.100$0.100$0.010Low
MistralMistral Large 3 2512262K$0.500$1.500$0.050Low
MistralMistral Medium 3.1131K$0.400$2.000$0.040Low
MistralCodestral 2508256K$0.300$0.900$0.030Low
MistralDevstral Medium131K$0.400$2.000$0.040Low
MistralDevstral Small 1.1131K$0.100$0.300$0.010Low
MistralMistral Small 3.2 24B128K$0.070$0.200N/ALow
MistralMistral Medium 3131K$0.400$2.000$0.040Low
MistralMistral Small 3.1 24B128K$0.350$0.560N/ALow
MistralSaba33K$0.200$0.600$0.020Low
MistralMistral Small 333K$0.050$0.080N/ALow
MistralMistral Large 2411131K$2.000$6.000$0.200Mid
MistralMistral Large 2407131K$2.000$6.000$0.200Mid
MistralPixtral Large 2411131K$2.000$6.000$0.200Mid
MistralMistral Nemo131K$0.020$0.040N/ALow
MistralMixtral 8x22B Instruct66K$2.000$6.000$0.200Mid
MistralMistral Large128K$2.000$6.000$0.200Mid
MistralMixtral 8x7B Instruct33K$0.540$0.540N/ALow
MistralMistral 7B Instruct v0.13K$0.110$0.190N/ALow
AlibabaQwen3.6 Plus1M$0.330$1.950N/ALow
AlibabaQwen3.5-9B262K$0.100$0.150N/ALow
AlibabaQwen3.5-35B-A3B262K$0.160$1.300N/ALow
AlibabaQwen3.5-27B262K$0.200$1.560N/ALow
AlibabaQwen3.5-122B-A10B262K$0.260$2.080N/ALow
AlibabaQwen3.5-Flash1M$0.070$0.260N/ALow
AlibabaQwen3.5 Plus 2026-02-151M$0.260$1.560N/ALow
AlibabaQwen3.5 397B A17B262K$0.390$2.340$0.200Low
AlibabaQwen3 Max Thinking262K$0.780$3.900N/ALow
AlibabaQwen3 Coder Next262K$0.150$0.800$0.120Low
AlibabaQwen3 VL 32B Instruct131K$0.100$0.420N/ALow
AlibabaQwen3 VL 8B Thinking131K$0.120$1.360N/ALow
AlibabaQwen3 VL 8B Instruct131K$0.080$0.500N/ALow
AlibabaQwen3 VL 30B A3B Thinking131K$0.130$1.560N/ALow
AlibabaQwen3 VL 30B A3B Instruct131K$0.130$0.520N/ALow
AlibabaQwen3 VL 235B A22B Thinking131K$0.260$2.600N/ALow
AlibabaQwen3 VL 235B A22B Instruct262K$0.200$0.880$0.110Low
AlibabaQwen3 Max262K$0.780$3.900$0.160Low
AlibabaQwen3 Coder Plus1M$0.650$3.250$0.130Low
AlibabaQwen3 Coder Flash1M$0.200$0.970$0.040Low
AlibabaQwen3 Next 80B A3B Thinking131K$0.100$0.780N/ALow
AlibabaQwen3 Next 80B A3B Instruct (free)262K$0.000$0.000N/ALow
AlibabaQwen3 Next 80B A3B Instruct262K$0.090$1.100N/ALow
AlibabaQwen Plus 0728 (thinking)1M$0.260$0.780N/ALow
AlibabaQwen Plus 07281M$0.260$0.780N/ALow
AlibabaQwen3 30B A3B Thinking 2507131K$0.080$0.400$0.080Low
AlibabaQwen3 Coder 30B A3B Instruct160K$0.070$0.270N/ALow
AlibabaQwen3 30B A3B Instruct 2507262K$0.090$0.300N/ALow
AlibabaQwen3 235B A22B Thinking 2507262K$0.130$0.600N/ALow
AlibabaQwen3 Coder 480B A35B (free)262K$0.000$0.000N/ALow
AlibabaQwen3 Coder 480B A35B262K$0.220$1.000$0.020Low
AlibabaQwen3 235B A22B Instruct 2507262K$0.070$0.100N/ALow
AlibabaQwen3 30B A3B41K$0.080$0.280N/ALow
AlibabaQwen3 8B41K$0.050$0.400$0.050Low
AlibabaQwen3 14B41K$0.060$0.240N/ALow
AlibabaQwen3 32B41K$0.080$0.240$0.040Low
AlibabaQwen3 235B A22B131K$0.450$1.820N/ALow
AlibabaQwen2.5 VL 32B Instruct128K$0.200$0.600N/ALow
AlibabaQwen VL Plus131K$0.140$0.410$0.030Low
AlibabaQwen VL Max131K$0.520$2.080N/ALow
AlibabaQwen-Turbo131K$0.030$0.130$0.010Low
AlibabaQwen2.5 VL 72B Instruct32K$0.250$0.750N/ALow
AlibabaQwen-Plus1M$0.260$0.780$0.050Low
AlibabaQwen-Max33K$1.040$4.160$0.210Mid
AlibabaQwen2.5 Coder 32B Instruct33K$0.660$1.000N/ALow
AlibabaQwen2.5 7B Instruct33K$0.040$0.100N/ALow
AlibabaQwen2.5 72B Instruct33K$0.120$0.390N/ALow
DeepSeekDeepSeek-V3.2 (Non-thinking)128K$0.280$0.420$0.028Low
DeepSeekDeepSeek-V3.2 (Thinking Mode)128K$0.280$0.420$0.028Low
Vercelv0 Base128K$0.500$2.500$0.050Low
Vercelv0 Mini128K$1.000$5.000$0.100Mid
Vercelv0 Pro200K$3.000$15.000$0.300High
Vercelv0 Max200K$5.000$25.000$0.500Mid
GroqGPT OSS 20B128K$0.075$0.300$0.037Low
GroqGPT OSS Safeguard 20B128K$0.075$0.300$0.037Low
GroqGPT OSS 120B128K$0.150$0.600$0.075Low
GroqKimi K2-0905256K$1.000$3.000$0.500Mid
GroqLlama 4 Scout128K$0.110$0.340$0.055Low
GroqLlama 4 Maverick128K$0.200$0.600$0.100Low
GroqQwen3 32B131K$0.290$0.590$0.145Low
GroqLlama 3.3 70B Versatile128K$0.590$0.790$0.295Low
GroqLlama 3.1 8B Instant128K$0.050$0.080$0.025Low
Together AIGLM-5128K$1.000$3.200$0.500Mid
Together AIGLM-4.7128K$0.450$2.000$0.220Low
Together AIGLM-4.5-Air128K$0.200$1.100$0.100Low
Together AIMiniMax M2.5228K$0.300$1.200$0.150Low
Together AIKimi K2.5256K$0.500$2.800$0.250Low
Together AIKimi K2 Instruct256K$1.000$3.000$0.500Mid
Together AIKimi K2 Thinking256K$1.200$4.000$0.600Mid
PerplexitySonar Pro Search200K$3.000$15.000N/AMid
PerplexitySonar Reasoning Pro128K$2.000$8.000N/AMid
PerplexitySonar Pro200K$3.000$15.000N/AMid
PerplexitySonar Deep Research128K$2.000$8.000N/AMid
PerplexitySonar127K$1.000$1.000N/AMid
CohereCommand R128K$0.150$0.600N/ALow
CohereCommand R+128K$2.500$10.000N/AMid
AmazonNova Micro128K$0.040$0.140N/ALow
AmazonNova Lite300K$0.060$0.240N/ALow
AmazonNova Pro300K$0.800$3.200N/ALow
MicrosoftPhi-416K$0.070$0.140N/ALow
MicrosoftPhi-3.5 Mini128K$0.130$0.500N/ALow

Understanding AI Pricing & Terminology

What are Tokens?

Tokens are the basic units of text that AI models process. They represent pieces of words, not entire words. For example, the word "unhappiness" might be broken into the tokens "un", "happiness".

Input Tokens

These are the tokens in your prompt or question to the AI. Input tokens include:

  • Your instructions to the AI
  • Context information you provide
  • Examples you include
  • System messages defining AI behavior

Output Tokens

These are the tokens in the AI‘s response. Output tokens usually cost more than input tokens because:

  • They require more computational work
  • The model must make predictions for each token
  • They represent the AI‘s unique "work product"

How Many Tokens in Text?

As a general rule of thumb:

  • 1 token ≈ 4 characters in English text
  • 1 token ≈ ¾ of a word in English
  • 100 tokens ≈ 75 words or ≈ 1 paragraph
  • 1,000 tokens ≈ 750 words or ≈ 1 page
  • 1M tokens ≈ 750,000 words or ≈ 1,500 pages

Disclaimers

  • Prices may vary based on enterprise agreements and volume discounts.
  • Prices are subject to change without notice. Always check the official pricing pages of providers.
  • Context lengths and capabilities may vary for different use cases and implementations.
  • This information is provided for reference only and should not be considered financial advice.

Understanding AI Model Pricing

AI model pricing is typically based on token consumption, where tokens represent chunks of text processed by the model. Understanding how pricing works helps you optimize costs and choose the right model for your specific use case.

Token Basics

  • • 1 token ≈ 4 characters or ¾ of a word in English
  • • Different languages have varying token densities
  • • Code and special characters may use more tokens
  • • Tokens include both input (prompts) and output (responses)

Cost Calculation

  • • Input tokens: Text you send to the AI
  • • Output tokens: Text the AI generates
  • • Total cost = (Input tokens × Input rate) + (Output tokens × Output rate)
  • • Rates are typically shown per 1 million tokens

Model Complexity

More sophisticated models with larger parameter counts typically cost more per token.

Basic Models
$0.50-2/1M
Advanced Models
$3-15/1M
Premium Models
$15-60/1M

Context Windows

Larger context windows allow more information but may increase costs.

4K-8K tokens
Standard
32K-128K tokens
Extended
1M+ tokens
Long Context

Usage Patterns

Your input/output ratio affects total costs significantly.

High Input/Low Output
Analysis
Low Input/High Output
Generation
Balanced Usage
Chat

Cost Optimization Strategies

Input Optimization

  • • Use concise, clear prompts to minimize input tokens
  • • Avoid repetitive instructions within conversations
  • • Leverage system messages for context setting
  • • Consider prompt templates for consistency
  • • Use cached inputs when available for repeated queries

Output Management

  • • Set maximum token limits for generated responses
  • • Request specific formats (bullet points vs paragraphs)
  • • Use stop sequences to control output length
  • • Consider streaming for better user experience
  • • Choose models based on your output quality needs

Key Provider Differences

OpenAI

  • • Wide range of models from basic to advanced
  • • Strong performance across various tasks
  • • Regular model updates and improvements
  • • Comprehensive API documentation

Anthropic (Claude)

  • • Focus on safety and helpful responses
  • • Large context windows (up to 200K tokens)
  • • Strong performance on reasoning tasks
  • • Constitutional AI approach

Google (Gemini)

  • • Multimodal capabilities (text, image, audio)
  • • Competitive pricing for high-volume usage
  • • Integration with Google Cloud services
  • • Strong performance on technical tasks

Current Pricing Trends

Downward Trends

  • • Overall token costs decreasing as technology improves
  • • More competitive pricing due to market competition
  • • Introduction of tiered pricing for different use cases
  • • Better price/performance ratios with newer models

Market Dynamics

  • • Premium features (larger context, multimodal) command higher prices
  • • Volume discounts available for enterprise customers
  • • Cached inputs reducing costs for repeated queries
  • • Regional pricing variations emerging

Choosing the Right Model for Your Budget

Budget-Conscious
Low-Cost Applications

For high-volume, simple tasks where cost efficiency is paramount.

  • • Basic text processing and classification
  • • Simple Q&A systems
  • • Content moderation
  • • Data extraction from structured text

Balanced
General Purpose Applications

For most business applications requiring good quality and reasonable costs.

  • • Customer support chatbots
  • • Content generation and editing
  • • Code assistance and debugging
  • • Educational applications

Premium
High-Quality Applications

For critical applications where quality justifies higher costs.

  • • Complex reasoning and analysis
  • • Creative writing and content creation
  • • Research and technical documentation
  • • Multi-step problem solving

Estimating Your Costs

Use these guidelines to estimate your monthly AI costs based on usage patterns:

Light Usage
$10-50
  • ~100K tokens/month
  • Personal projects
  • Small applications
  • Testing and development
Moderate Usage
$100-500
  • ~1M tokens/month
  • Small business applications
  • Customer support bots
  • Content generation
Heavy Usage
$500+
  • 10M+ tokens/month
  • Enterprise applications
  • High-volume processing
  • 24/7 production systems

Real-World Cost Examples

Customer Support Chatbot

Business
$105/month

A moderate-volume customer support bot handling 1,000 conversations per day

Usage Pattern:
  • • Avg Input: 150 tokens
  • • Avg Output: 100 tokens
  • • Volume: 1000 interactions/day
  • • Model: GPT-4o
Cost Breakdown:

Monthly Input: 4.50M tokens

Monthly Output: 3.00M tokens

Total: $105

Content Generation Tool

Creative
$1,170/month

A content creation tool generating 50 blog posts per month

Usage Pattern:
  • • Avg Input: 300 tokens
  • • Avg Output: 1500 tokens
  • • Volume: 50 interactions/day
  • • Model: Claude Sonnet
Cost Breakdown:

Monthly Input: 0.01M tokens

Monthly Output: 0.07M tokens

Total: $1,170

Code Assistant

Development
$5.94/month

A development team using AI for code review and suggestions

Usage Pattern:
  • • Avg Input: 500 tokens
  • • Avg Output: 200 tokens
  • • Volume: 200 interactions/day
  • • Model: GPT-4o mini
Cost Breakdown:

Monthly Input: 2.20M tokens

Monthly Output: 0.88M tokens

Total: $5.94

Best Models by Use Case

High Input, Low Output

Document analysis and summarization

Example: Analyzing 10-page documents → 2-paragraph summaries
Recommended Models:
Claude Haiku
Low input cost, efficient processing
GPT-4o mini
Best value for analysis tasks

Tip: Focus on models with low input pricing

Low Input, High Output

Creative writing and content generation

Example: Short prompts → Long-form articles
Recommended Models:
GPT-4o
Good output quality, reasonable output pricing
Gemini Pro
Competitive output rates

Tip: Prioritize models with good output pricing and quality

Balanced Usage

Interactive conversations and Q&A

Example: Chat applications with moderate exchanges
Recommended Models:
Claude Sonnet
Balanced pricing, good conversation quality
GPT-4o
Reliable performance across use cases

Tip: Consider overall cost per conversation

Monthly Budget Planning Guide

Monthly Budget:
$50

Recommended Strategy:
  • GPT-4o mini for most tasks (~300K tokens)
  • Claude Haiku for document processing
  • Perfect for personal projects and prototyping
What You Can Achieve:
  • 50 detailed conversations
  • 100 code reviews
  • 25 document summaries

Monthly Budget:
$200

Recommended Strategy:
  • Mix of GPT-4o and GPT-4o mini (~100K quality tokens)
  • Claude Sonnet for complex reasoning
  • Good for small business applications
What You Can Achieve:
  • 500 customer interactions
  • 100 content pieces
  • 1,000 code assists

Monthly Budget:
$1,000

Recommended Strategy:
  • GPT-4o for critical tasks (~100K tokens)
  • Claude Sonnet for reasoning (~200K tokens)
  • GPT-4o mini for high-volume tasks (~5M tokens)
What You Can Achieve:
  • 5,000 support tickets
  • 200 articles
  • 10,000 code reviews

Advanced Cost Optimization Tips

Cost-Saving Strategies

  • Model Switching: Use cheaper models for simple tasks, premium for complex ones
  • Batch Processing: Group similar requests to reduce overhead
  • Caching: Store and reuse responses for common queries
  • Prompt Engineering: Optimize prompts to reduce token usage
  • Output Limits: Set max_tokens to control response length

Monitoring and Optimization

  • Usage Analytics: Track token consumption patterns
  • A/B Testing: Compare model performance vs cost
  • Regular Reviews: Reassess model choices quarterly
  • Alert Systems: Set up budget alerts for cost control
  • ROI Analysis: Measure value generated per dollar spent

Industry-Specific Cost Examples

E-commerce

  • Product descriptions: $200-500/month
  • Customer service: $300-800/month
  • Review analysis: $100-300/month

Focus on Claude Haiku for descriptions, GPT-4o mini for support

Software Development

  • Code generation: $400-1000/month
  • Documentation: $200-500/month
  • Code review: $100-400/month

Mix GPT-4o for complex logic, GPT-4o mini for routine tasks

Content & Marketing

  • Blog writing: $500-1500/month
  • Social media: $200-600/month
  • Ad copy: $300-800/month

Claude Sonnet for long-form, GPT-4o for creative campaigns