Complete AI API Pricing Database: 15 Models Compared (May 2026)

Published May 27, 2026 · API Benchmarks

This is the most complete AI API pricing comparison available as of May 2026. All prices verified against official provider documentation and independent testing. Prices are per million tokens in USD.

Complete Pricing Table

ModelInput $/MOutput $/MMax Contexttok/sEfficiency
Qwen3-8B$0.01$0.0132K15615,600
GLM-4-9B$0.01$0.01128K11011,000
Step-3.5-Flash$0.08$0.1532K1601,066
Qwen3.5-27B$0.12$0.19128K95500
DeepSeek V4 Flash$0.18$0.25128K142568
Qwen3-32B$0.18$0.28128K128457
Qwen-MT-Turbo$0.18$0.3032K90300
Qwen3-Coder-30B$0.20$0.35128K105300
DeepSeek V3.2$0.32$0.38128K78205
Hunyuan-Turbo$0.35$0.5732K118207
GLM-4-32B$0.49$0.56128K72128
DeepSeek V4 Pro$0.52$0.75128K5573
GLM-5$0.73$1.92128K4825
Kimi K2.5$0.59$3.00128K5217
DeepSeek-R1$1.20$2.5064K3514

Cost-Efficiency Ranking

The efficiency score is tokens-per-second divided by output price. Higher = more speed for your dollar. Qwen3-8B dominates because it's both the fastest and cheapest. DeepSeek V4 Flash is the efficiency champion among mid-tier models.

All models accessible via Global API — unified endpoint, PayPal billing.

Also Read on Our Network