Last updated: March 31, 2026 Maintained by the CheapAI team Verify pricing → Raw data (JSON)

AI Models

All models available through CheapAI — at up to 65% below official API pricing. Click any card for detailed specs, use-cases, and setup guidance.

See Pricing API Docs

All Models

Supported model catalog

Every model below is available through CheapAI's OpenAI-compatible API proxy. You connect once; switch models by changing a single parameter.

Anthropic — Claude

Anthropic Claude Claude Opus 4.6

Anthropic's most capable model. Exceptional at complex reasoning, multi-step coding, research synthesis, and long-context analysis up to 1M tokens. Best choice for demanding agentic workflows.

Input: $5.00/1M tokens Output: $25.00/1M tokens Context: 200K–1M Thinking: ✓
Cursor AI Claude Code LangChain Complex reasoning
Anthropic Claude Sonnet 4.6

The ideal balance of capability and speed. Handles most coding, writing, and analysis tasks with near-Opus quality at significantly lower cost. Supports extended thinking and 1M context.

Input: $3.00/1M tokens Output: $15.00/1M tokens Context: 200K–1M Thinking: ✓
Cursor AI Best value Agentic
Anthropic Claude Opus 4.6

Previous generation Opus for high-demand workflows. Excellent reasoning at a proven-capability level. 200K context window. No thinking mode.

Input: $5.00/1M tokens Output: $25.00/1M tokens Context: 200K
Coding Analysis
Anthropic Claude Haiku 4.5

Fastest Claude model. Built for low-latency applications, classification, summarisation, and high-volume tasks where speed matters more than maximal capability.

Input: $1.00/1M tokens Output: $5.00/1M tokens Context: 200K
Fast High-volume Low latency

OpenAI — GPT

OpenAI GPT-5.4

OpenAI's flagship multimodal model. Handles complex reasoning, code generation, image analysis, and long-context tasks. The most capable GPT-5.3 Codex variant, optimized for production use.

Input: $2.50/1M tokens Output: $15.00/1M tokens
Cursor AI Multimodal Coding
OpenAI GPT-5.3 Codex Codex

Coding-optimized GPT-5.3 Codex variant. Superior at code generation, refactoring, test writing, and agentic development tasks. Ideal for Cursor AI and Claude Code-style workflows.

Input: $1.75/1M tokens Output: $14.00/1M tokens
Code-optimized Cursor AI
OpenAI GPT-5.2 Codex

Balanced capability and cost. Strong at code, writing, and analysis. Available with $90/day or unlimited spend limits — ideal for sustained development workflows at controlled cost.

Input: $1.75/1M tokens Output: $14.00/1M tokens
Best value GPT Coding

Google — Gemini

Google Gemini 3 Pro

Google's most advanced Gemini model. Outstanding at multimodal reasoning, long-document analysis, and tasks requiring broad world knowledge. Native multimodal from ground up.

Input: $2.00/1M tokens Output: $12.00/1M tokens
Multimodal Long context
Google Gemini 3 Flash

Fast, cost-efficient Gemini model for high-throughput pipelines. Excellent at classification, summarisation, structured output, and multi-turn conversation at scale.

Input: $0.50/1M tokens Output: $3.00/1M tokens
Fast High-volume Budget

DeepSeek

DeepSeek DeepSeek V3.2

State-of-the-art open-source model with frontier-level coding and math performance at a fraction of the price. Excellent for developers running high-volume code generation, data analysis, or research pipelines.

Input: $0.28/1M tokens Output: $0.42/1M tokens
Ultra-low cost Coding Math

Interactive directory