13 models across 3 providers. Use smart routing or pick a specific model.
Don't know which model to use? Let us pick for you. Just set model="auto" and we handle the rest.
autoRecommendedWe analyze your request and pick the cheapest model that can handle it. Best for most use cases.
Best for: Everyone — save money automatically
fastCheapestAlways uses the fastest, cheapest model. Great for simple questions, chat, and quick lookups.
Best for: Simple tasks, high volume
balancedValueA good balance of cost and quality. Handles most tasks well without premium pricing.
Best for: General-purpose work
bestPremiumAlways uses the most capable model. For tasks that need the highest quality output.
Best for: Complex coding, research, analysis
Specify a model by name to bypass smart routing. Your request goes directly to that provider.
GPT-4o
Flagshipgpt-4oComplex tasks, coding, detailed analysis
Context: 128K tokens
GPT-4o Mini
Fastgpt-4o-miniQuick answers, simple tasks, chat
Context: 128K tokens
GPT-4 Turbo
Flagshipgpt-4-turboLong documents, detailed work
Context: 128K tokens
GPT-4
Legacygpt-4Reliable general-purpose tasks
Context: 8K tokens
GPT-3.5 Turbo
Legacygpt-3.5-turboBasic tasks, high volume
Context: 16K tokens
o1
Reasoningo1Deep reasoning, math, logic problems
Context: 200K tokens
o3-mini
Reasoningo3-miniReasoning tasks, smaller and faster
Context: 200K tokens
Claude Sonnet 4.5
Flagshipclaude-sonnet-4-5Nuanced writing, analysis, coding
Context: 200K tokens
Claude Haiku 4.5
Fastclaude-haiku-4-5Quick responses, summaries
Context: 200K tokens
Claude Opus 4
Flagshipclaude-opus-4Most capable Claude, complex research
Context: 200K tokens
Claude 3 Haiku
Legacyclaude-3-haikuBasic tasks, previous generation
Context: 200K tokens
Gemini 2.5 Flash
Fastgemini-2.5-flashLarge documents, fast processing
Context: 1M tokens
Gemini 2.5 Pro
Flagshipgemini-2.5-proComplex tasks, massive context window
Context: 1M tokens
Not sure which model to choose?
Use auto and let Thermly handle it. We'll analyze each request and route it to the most cost-effective model that can deliver great results.