AI Pricing Table

Compare API costs across all major AI providers in one comprehensive table

View detailed pricing for OpenAI, Claude, Gemini, and other AI providers in an easy-to-read table format. Compare input (prompts) and output (generated content) per 1 million tokens costs across different models to find the most cost-effective AI solution for your needs.

Prices are shown in USD per 1 million tokens. Token counts vary by model and language, but as a rough guide:
1 token ≈ 4 characters or ¾ of a word in English.

Last updated: May 2025

Model Prices
Per 1M tokens

Provider
Model
Context Window
Input Price
Output Price
Cached Input Price
Price Range
GoogleGemma 3na$0.000$0.000$0.000Low
AlibabaQwen-Turbo1M$0.050$0.200$0.020Low
DeepSeekDeepSeek-Chat64K$0.070$1.100$0.035Low
GoogleGemini 2.0 Flash-Lite1M$0.075$0.300N/ALow
GoogleGemini 1.5 Flash (>128K)1M$0.075$0.300$0.019Low
GoogleGemini 2.0 Flash1M$0.100$0.400$0.025Low
DeepSeekDeepSeek-Reasoner64K$0.140$2.190$0.035Mid
OpenAIGPT-4o mini128K$0.150$0.600$0.075Low
GoogleGemini 1.5 Flash (>128K)1M$0.150$0.600$0.037Low
AlibabaQwen-Plus131K$0.400$1.200$0.120Low
AnthropicClaude Haiku 3.5200K$0.800$4.000N/AMid
OpenAIo3-mini200K$1.100$4.400$0.550Mid
GoogleGemini 1.5 Pro1M$1.250$5.000N/AMid
AlibabaQwen-Max32K$1.600$6.400$0.640Mid
OpenAIGPT-4o128K$2.500$10.000$1.250Mid
AnthropicClaude Sonnet 3.7200K$3.000$15.000N/AMid
AnthropicClaude Opus 3200K$15.000$75.000N/AHigh
OpenAIo1 Frontier200K$15.000$60.000$7.500High
OpenAIGPT-4.5128K$75.000$150.000$37.500High
OpenAIGPT-4.11M$2.000$8.000$0.500Mid
OpenAIGPT-4.1 mini1M$0.400$1.600$0.100Low
OpenAIGPT-4.1 nano1M$0.100$0.400$0.025Low
OpenAIGPT-4.1 (Fine-tuning)1M$3.000$12.000$0.750Mid
OpenAIGPT-4.1 mini (Fine-tuning)1M$0.800$3.200$0.200Mid
OpenAIGPT-4o (Fine-tuning)1M$3.750$15.000$1.875Mid
xAIGrok 3131K$3.000$15.000N/AMid
xAIGrok 3 (Fast)131K$5.000$25.000N/AHigh
xAIGrok 3 Mini131K$0.300$0.500N/ALow
xAIGrok 3 Mini (Fast)131K$0.600$4.000N/AMid
xAIgrok-2-vision-1212131K$1.500$6.000$0.750Mid
MistralMistral Large32K$8.000$24.000N/AHigh
MistralMistral Medium32K$2.500$7.500N/AMid
MistralMistral Small32K$0.200$0.600N/ALow
GoogleGemini 2.5 Flash Preview1M$0.150$0.600$0.037Low
GoogleGemini 2.5 Flash Preview (Thinking)1M$0.150$3.500$0.037Low
GoogleGemini 2.5 Pro Preview1M$1.250$10.000$0.310Mid
Vercelv0-1.0-md1M$3.000$15.000N/AMid
AnthropicClaude Sonnet 3.5200K$3.000$15.000N/AMid
AnthropicClaude Sonnet 4200K$3.000$15.000N/AMid
AnthropicClaude Opus 4200K$15.000$75.000N/AHigh

Understanding AI Pricing & Terminology

What are Tokens?

Tokens are the basic units of text that AI models process. They represent pieces of words, not entire words. For example, the word "unhappiness" might be broken into the tokens "un", "happiness".

Input Tokens

These are the tokens in your prompt or question to the AI. Input tokens include:

  • Your instructions to the AI
  • Context information you provide
  • Examples you include
  • System messages defining AI behavior

Output Tokens

These are the tokens in the AI‘s response. Output tokens usually cost more than input tokens because:

  • They require more computational work
  • The model must make predictions for each token
  • They represent the AI‘s unique "work product"

How Many Tokens in Text?

As a general rule of thumb:

  • 1 token ≈ 4 characters in English text
  • 1 token ≈ ¾ of a word in English
  • 100 tokens ≈ 75 words or ≈ 1 paragraph
  • 1,000 tokens ≈ 750 words or ≈ 1 page
  • 1M tokens ≈ 750,000 words or ≈ 1,500 pages

Disclaimers

  • Prices may vary based on enterprise agreements and volume discounts.
  • Prices are subject to change without notice. Always check the official pricing pages of providers.
  • Context lengths and capabilities may vary for different use cases and implementations.
  • This information is provided for reference only and should not be considered financial advice.

Understanding AI Model Pricing

AI model pricing is typically based on token consumption, where tokens represent chunks of text processed by the model. Understanding how pricing works helps you optimize costs and choose the right model for your specific use case.

Token Basics

  • • 1 token ≈ 4 characters or ¾ of a word in English
  • • Different languages have varying token densities
  • • Code and special characters may use more tokens
  • • Tokens include both input (prompts) and output (responses)

Cost Calculation

  • • Input tokens: Text you send to the AI
  • • Output tokens: Text the AI generates
  • • Total cost = (Input tokens × Input rate) + (Output tokens × Output rate)
  • • Rates are typically shown per 1 million tokens

Model Complexity

More sophisticated models with larger parameter counts typically cost more per token.

Basic Models
$0.50-2/1M
Advanced Models
$3-15/1M
Premium Models
$15-60/1M

Context Windows

Larger context windows allow more information but may increase costs.

4K-8K tokens
Standard
32K-128K tokens
Extended
1M+ tokens
Long Context

Usage Patterns

Your input/output ratio affects total costs significantly.

High Input/Low Output
Analysis
Low Input/High Output
Generation
Balanced Usage
Chat

Cost Optimization Strategies

Input Optimization

  • • Use concise, clear prompts to minimize input tokens
  • • Avoid repetitive instructions within conversations
  • • Leverage system messages for context setting
  • • Consider prompt templates for consistency
  • • Use cached inputs when available for repeated queries

Output Management

  • • Set maximum token limits for generated responses
  • • Request specific formats (bullet points vs paragraphs)
  • • Use stop sequences to control output length
  • • Consider streaming for better user experience
  • • Choose models based on your output quality needs

Key Provider Differences

OpenAI

  • • Wide range of models from basic to advanced
  • • Strong performance across various tasks
  • • Regular model updates and improvements
  • • Comprehensive API documentation

Anthropic (Claude)

  • • Focus on safety and helpful responses
  • • Large context windows (up to 200K tokens)
  • • Strong performance on reasoning tasks
  • • Constitutional AI approach

Google (Gemini)

  • • Multimodal capabilities (text, image, audio)
  • • Competitive pricing for high-volume usage
  • • Integration with Google Cloud services
  • • Strong performance on technical tasks

Current Pricing Trends

Downward Trends

  • • Overall token costs decreasing as technology improves
  • • More competitive pricing due to market competition
  • • Introduction of tiered pricing for different use cases
  • • Better price/performance ratios with newer models

Market Dynamics

  • • Premium features (larger context, multimodal) command higher prices
  • • Volume discounts available for enterprise customers
  • • Cached inputs reducing costs for repeated queries
  • • Regional pricing variations emerging

Choosing the Right Model for Your Budget

Budget-Conscious
Low-Cost Applications

For high-volume, simple tasks where cost efficiency is paramount.

  • • Basic text processing and classification
  • • Simple Q&A systems
  • • Content moderation
  • • Data extraction from structured text

Balanced
General Purpose Applications

For most business applications requiring good quality and reasonable costs.

  • • Customer support chatbots
  • • Content generation and editing
  • • Code assistance and debugging
  • • Educational applications

Premium
High-Quality Applications

For critical applications where quality justifies higher costs.

  • • Complex reasoning and analysis
  • • Creative writing and content creation
  • • Research and technical documentation
  • • Multi-step problem solving

Estimating Your Costs

Use these guidelines to estimate your monthly AI costs based on usage patterns:

Light Usage
$10-50
  • ~100K tokens/month
  • Personal projects
  • Small applications
  • Testing and development
Moderate Usage
$100-500
  • ~1M tokens/month
  • Small business applications
  • Customer support bots
  • Content generation
Heavy Usage
$500+
  • 10M+ tokens/month
  • Enterprise applications
  • High-volume processing
  • 24/7 production systems

Real-World Cost Examples

Customer Support Chatbot

Business
$105/month

A moderate-volume customer support bot handling 1,000 conversations per day

Usage Pattern:
  • • Avg Input: 150 tokens
  • • Avg Output: 100 tokens
  • • Volume: 1000 interactions/day
  • • Model: GPT-4o
Cost Breakdown:

Monthly Input: 4.50M tokens

Monthly Output: 3.00M tokens

Total: $105

Content Generation Tool

Creative
$1,170/month

A content creation tool generating 50 blog posts per month

Usage Pattern:
  • • Avg Input: 300 tokens
  • • Avg Output: 1500 tokens
  • • Volume: 50 interactions/day
  • • Model: Claude Sonnet
Cost Breakdown:

Monthly Input: 0.01M tokens

Monthly Output: 0.07M tokens

Total: $1,170

Code Assistant

Development
$5.94/month

A development team using AI for code review and suggestions

Usage Pattern:
  • • Avg Input: 500 tokens
  • • Avg Output: 200 tokens
  • • Volume: 200 interactions/day
  • • Model: GPT-4o mini
Cost Breakdown:

Monthly Input: 2.20M tokens

Monthly Output: 0.88M tokens

Total: $5.94

Best Models by Use Case

High Input, Low Output

Document analysis and summarization

Example: Analyzing 10-page documents → 2-paragraph summaries
Recommended Models:
Claude Haiku
Low input cost, efficient processing
GPT-4o mini
Best value for analysis tasks

Tip: Focus on models with low input pricing

Low Input, High Output

Creative writing and content generation

Example: Short prompts → Long-form articles
Recommended Models:
GPT-4o
Good output quality, reasonable output pricing
Gemini Pro
Competitive output rates

Tip: Prioritize models with good output pricing and quality

Balanced Usage

Interactive conversations and Q&A

Example: Chat applications with moderate exchanges
Recommended Models:
Claude Sonnet
Balanced pricing, good conversation quality
GPT-4o
Reliable performance across use cases

Tip: Consider overall cost per conversation

Monthly Budget Planning Guide

Monthly Budget:
$50

Recommended Strategy:
  • GPT-4o mini for most tasks (~300K tokens)
  • Claude Haiku for document processing
  • Perfect for personal projects and prototyping
What You Can Achieve:
  • 50 detailed conversations
  • 100 code reviews
  • 25 document summaries

Monthly Budget:
$200

Recommended Strategy:
  • Mix of GPT-4o and GPT-4o mini (~100K quality tokens)
  • Claude Sonnet for complex reasoning
  • Good for small business applications
What You Can Achieve:
  • 500 customer interactions
  • 100 content pieces
  • 1,000 code assists

Monthly Budget:
$1,000

Recommended Strategy:
  • GPT-4o for critical tasks (~100K tokens)
  • Claude Sonnet for reasoning (~200K tokens)
  • GPT-4o mini for high-volume tasks (~5M tokens)
What You Can Achieve:
  • 5,000 support tickets
  • 200 articles
  • 10,000 code reviews

Advanced Cost Optimization Tips

Cost-Saving Strategies

  • Model Switching: Use cheaper models for simple tasks, premium for complex ones
  • Batch Processing: Group similar requests to reduce overhead
  • Caching: Store and reuse responses for common queries
  • Prompt Engineering: Optimize prompts to reduce token usage
  • Output Limits: Set max_tokens to control response length

Monitoring and Optimization

  • Usage Analytics: Track token consumption patterns
  • A/B Testing: Compare model performance vs cost
  • Regular Reviews: Reassess model choices quarterly
  • Alert Systems: Set up budget alerts for cost control
  • ROI Analysis: Measure value generated per dollar spent

Industry-Specific Cost Examples

E-commerce

  • Product descriptions: $200-500/month
  • Customer service: $300-800/month
  • Review analysis: $100-300/month

Focus on Claude Haiku for descriptions, GPT-4o mini for support

Software Development

  • Code generation: $400-1000/month
  • Documentation: $200-500/month
  • Code review: $100-400/month

Mix GPT-4o for complex logic, GPT-4o mini for routine tasks

Content & Marketing

  • Blog writing: $500-1500/month
  • Social media: $200-600/month
  • Ad copy: $300-800/month

Claude Sonnet for long-form, GPT-4o for creative campaigns