Choosing an AI Provider

Batch Translate with AI works with any OpenAI-compatible API. Nine providers are built in — you just pick one, get an API key, and paste it in. No other configuration needed.

Quick recommendation

If you want…	Use
Cheapest option for bulk translation	DeepSeek
Best overall quality	OpenAI (gpt-4o or gpt-4o-mini)
Cheapest multi-language runs	OpenAI or DeepSeek (prompt caching)
Fastest responses	Groq
No account, many models	OpenRouter
European data residency	Mistral AI

Multi-language efficiency (prompt caching)

When you translate one document into several languages, the source text has to reach the model for each language. Providers with prompt caching reuse the source from cache for languages 2…N at a large discount instead of charging full price every time.

In the app, caching-capable providers are labelled “Cost-efficient for multi-language.” The translation engine still works on any provider — it just won’t get the discount on ones without caching, and the settings dialog tells you which is which.

Caching support	Providers
Yes (cost-efficient for multi-language)	OpenAI, DeepSeek, OpenRouter*
No (still works, no multi-language discount)	Groq, Mistral, Together AI, xAI, Fireworks, Cerebras

* OpenRouter caching depends on the underlying model you route to.

All supported providers

Provider	Strengths	Free tier	Sign up
OpenAI	Best quality, reliable, prompt caching	No (credits on signup)	platform.openai.com
DeepSeek	Extremely low cost, strong quality, context caching	Yes (limited)	platform.deepseek.com
OpenRouter	100+ models from one key, caching varies by model	No	openrouter.ai
Groq	Very fast inference, good for high volume	Yes (rate limited)	console.groq.com
Mistral AI	Strong multilingual, EU-based	Yes (limited)	console.mistral.ai
Together AI	Wide model selection, competitive pricing	Yes ($1 credit)	api.together.ai
xAI (Grok)	Grok models, good reasoning	No	console.x.ai
Fireworks AI	Fast, cost-effective	Yes ($1 credit)	fireworks.ai
Cerebras	Ultra-fast inference	Yes (limited)	cloud.cerebras.ai

Cost considerations

All providers charge per token — roughly 1 token per 0.75 words. Translation output is usually a little longer than the source, and some languages (German, Finnish) use more tokens per character than English.

A 10-page document into one language uses very roughly 8,000–20,000 tokens total
A 300-page book into one language can run into the low millions of tokens
At DeepSeek’s rates a large book typically costs a few dollars; premium models cost more
Translating into N languages with a caching-capable provider costs far less than N× the single-language cost

Use Estimate size before large jobs, and watch the live projection during a run. The app deliberately does not display prices — multiply the token figure by your provider’s current published rate.

Using a custom endpoint

If you use a self-hosted model (Ollama, LM Studio, LocalAI) or a provider not in the list, select Custom (OpenAI-Compatible) from the provider dropdown and enter your endpoint’s base URL manually. The endpoint must be OpenAI-compatible (/chat/completions). See Finding Your Base URL.

Switching providers

You can switch providers at any time from the key icon. Base URL, model, and the caching flag update automatically from the preset; your API key is stored securely on device.