Skip to main content
New! Try prompt optimization for free with our Free Tier. No credit card required for signed-in users.
GEPA token rates are charged per million tokens consumed during prompt optimization.

OpenAI Models

ModelInput (per 1M tokens)Output (per 1M tokens)
GPT-5.1$1.25$10.00
GPT-5.1 Mini$0.25$2.00
GPT-5.1 Nano$0.05$0.40
GPT-5$1.25$10.00
GPT-5 Mini$0.25$2.00
GPT-5 Nano$0.05$0.40
GPT-4.1$2.00$8.00
GPT-4.1 Mini$0.40$1.60
GPT-4.1 Nano$0.10$0.40
GPT-4o$2.50$10.00
GPT-4o Mini$0.15$0.60
Note: gpt-5-pro is explicitly rejected (too expensive: 15/15/120 per 1M tokens)

Groq Models

ModelInput (per 1M tokens)Output (per 1M tokens)
openai/gpt-oss-20b$0.075$0.30
openai/gpt-oss-120b$0.150$0.60
moonshotai/kimi-k2-0905$1.00$3.00
meta/llama-guard-4-12b$0.20$0.20
qwen/qwen3-32b$0.29$0.59
meta/llama-3.3-70b-versatile$0.59$0.79
meta/llama-3.1-8b-instant$0.05$0.08

Google Models

ModelInput (per 1M tokens)Output (per 1M tokens)
gemini-3-pro-preview$2.00$12.00
gemini-3-pro-preview-gt200k$4.00$18.00
gemini-2.5-pro$1.25$10.00
gemini-2.5-pro-gt200k$2.50$15.00
gemini-2.5-flash$0.30$2.50
gemini-2.5-flash-lite$0.10$0.40