How much does AI translation cost? DeepL, Google Translate, OpenAI compared (2026)

Kinga Pomykała
Kinga Pomykała
Last updated: June 03, 202612 min read
How much does AI translation cost? DeepL, Google Translate, OpenAI compared (2026)

Translation budget conversations usually start with the wrong question. Teams ask "which provider is best" before they know what "best" means for their project. Cost per character, cost per token, quality for European languages versus Asian languages, context awareness, privacy compliance: these are different trade-offs, and the answer depends on your use case.

This post gives you the actual numbers for 2026 across the providers teams most commonly use, a framework for comparing them fairly, and practical guidance on where each one earns its price.

Who this is for: developers and product teams deciding how to budget AI or machine translation into a localization workflow.

For a broader overview of how AI and machine translation fits into a localization strategy, start with the AI and machine translation guide. If you want the full picture on quality differences with examples, see our DeepL, Google Translate, and OpenAI auto-translation comparison.

Why translation pricing is confusing

The core problem is that providers use different billing units:

  • Traditional MT APIs (DeepL, Google Translate) bill per character
  • LLM-based models (OpenAI, Gemini, Claude, DeepSeek) bill per token, split into input and output

English averages roughly 4 characters per token. CJK scripts (Japanese, Chinese) are more token-efficient; German compound words run slightly higher. For a rough estimate: 1,000 characters ≈ 250 input tokens + 250 output tokens.

Input vs output. Output tokens cost 3-5x more than input. Everything in your prompt: source text, system instructions, key descriptions, context, is billed as input. It seems small per call but compounds across thousands of keys and many languages. Batching multiple strings per request keeps it manageable.

Reasoning tokens. Models like OpenAI o-series or Gemini 2.5 Pro with thinking enabled run an internal chain-of-thought that can be billed separately or included into the output cost, and is not visible in the output. For standard UI string translation, avoid them; they add cost and latency without meaningful quality gains.

Practical note: most real software localization projects have far fewer than 1 million characters to translate. A mid-sized SaaS app might have 50,000-200,000 characters across all supported locales, counting only new or changed strings per sprint. Smart caching (only sending changed keys to the translation API) is often the single biggest cost lever available.

Traditional machine translation APIs

These providers use specialized neural MT engines built for translation. They are fast, predictable, and billed purely on character volume.

Google Cloud Translation

Pricing:

  • 500,000 characters/month free (resets monthly, never expires)
  • Basic NMT: $20 per million characters
  • LLM Translation mode: $10 input + $10 output per million characters (no free tier)
  • Adaptive Translation (fine-tuned on your data): $25 + $25 per million characters

Language support: 249 languages as of 2026: the broadest coverage of any major provider. In late 2025, Google upgraded its models with Gemini-powered quality improvements, particularly on idioms and conversational content.

Best for: teams that need to cover a wide range of languages fast, especially non-European ones (Arabic, Southeast Asian scripts, rare language pairs). The standard NMT is well-priced at $20/M, and the free tier is generous enough for development and staging.

Watch out for: billing counts HTML tags as characters when present in your strings. If you are sending rich-text content through the API, strip markup first or your actual cost will be noticeably higher than estimated.

DeepL API

Pricing:

  • DeepL API Developer plan: 1 million characters as one-time credit for testing, no monthly fee
  • DeepL API Growth plan: 1 million characters/month for $32.50/month (or €29.75/mo in Europe)
    • usage limit is 50 million characters per month
    • $27.50 for 1,000,000 extra characters

API Free and API Pro plans have been discontinued for new signups as of mid-2026, but existing users can continue on their current plan.

Language support: 30 available languages, plus concentrated in European pairs. DeepL consistently produces more natural-sounding output for English-German, English-French, English-Spanish, and other European pairs, mainly for longer or more complex strings. DeepL's next-gen LLM adds over 80 new language pairs in 2026.

Best for: European SaaS products with customer-facing copy where natural tone matters. Marketing pages, onboarding flows, and UI copy that users read carefully benefit from DeepL's quality advantage in supported pairs.

Watch out for: The cost is higher than Google NMT for the same character volume, and the free tier is more limited. For non-European languages, the quality gap narrows, so the price premium may not be justified.

Source: DeepL Translator languages, DeepL API pricing accessed June 2026.

Quick comparison

Here is a quick cost comparison for 1 million characters of input text (assuming output length is roughly the same):

ProviderCost per 1M charsFree tierLanguages
Google NMT$20500K/mo249
DeepL API Growth$32.501M100+
Microsoft Translator$102M/mo100+

Microsoft Translator at $10/M and a 2M character permanent free tier is worth knowing about, particularly for teams that need broad coverage at lower cost and are not yet at the scale where DeepL's quality premium pays off.

LLM-based translation via API

LLMs are not translation-specialized engines. They are general-purpose language models that, given the right prompt and context, produce high-quality translations, particularly for nuanced, tone-sensitive, or ambiguous content. The cost structure and capabilities are different enough to treat separately.

How to convert LLM pricing to translation cost

LLMs bill per token. For translation, you have:

  • Input tokens: your system prompt + the source text + any context (key descriptions, project description, character limits, tags)
  • Output tokens: the translated text (+ any reasoning tokens if you enable thinking)

Output tokens typically cost 3-5x more than input tokens. For translation, a rough rule: output tokens ≈ input source-text tokens (output length roughly matches source length). The system prompt and context add overhead; typically 200-500 tokens per request, or much less if you batch many strings in one call.

For budget estimation, a useful approximation:
1,000 characters of source text ≈ 250 input tokens + 250 output tokens

At most mid-tier LLM prices, that makes AI translation comparable to or cheaper than character-based MT, if you batch efficiently.

OpenAI

OpenAI's models are the most widely used for LLM-based translation, with a range of options from budget to premium quality.

ModelInput (1M tokens)Output (1M tokens)Notes
GPT-4o-mini$0.15$0.60Best value for bulk UI strings
GPT-4o$2.50$10.00Strong general quality
GPT-4.1$2.00$8.00Good balance of cost and capability
Batch API (any model)50% off50% offAsync, 24h turnaround

GPT-4o-mini is the go-to for high-volume localization where cost is the primary concern. Quality is solid for most UI strings. At $0.15/M input and $0.60/M output, it competes directly with character-based MT on price while adding context awareness.

GPT-4o earns its price for marketing copy, onboarding flows, and content where tone matters. The context window (128K tokens) lets you batch many strings per request, which reduces per-key overhead from the system prompt.

Batch API is worth using for non-time-sensitive localization runs, translating all missing keys overnight at half the cost.

Google Gemini

Google's Gemini models are a strong alternative to OpenAI for translation, with competitive pricing and good quality across a wide range of languages.

ModelInput (1M tokens)Output (1M tokens)Notes
Gemini 2.5 Flash-Lite$0.10$0.40Cheapest capable model
Gemini 2.5 Flash$0.30$2.50Strong throughput
Gemini 2.5 Pro$1.25$10.00High reasoning quality

Gemini 2.5 Flash is a strong choice for bulk translation where speed and cost matter more than peak nuance. It handles multilingual content well and is often the most cost-effective option for large batches. Gemini also offers a generous free tier via AI Studio: 15 requests per minute at no cost on Flash models, which covers development and prototyping.

Anthropic Claude

ModelInput (1M tokens)Output (1M tokens)Notes
Claude Haiku 4.5$1.00$5.00Fast, budget-friendly
Claude Sonnet 4.6$3.00$15.00Nuanced output, strong for creative copy

Claude's particular strength in translation is tone and cultural nuance. For marketing pages, voice-heavy onboarding, and brand-sensitive copy, Claude Sonnet produces output that reads more naturally to native speakers. It is not the cheapest option for bulk UI strings, but the quality gap is visible in customer-facing content.

DeepSeek V3

DeepSeek is a newer player in the LLM space, with a focus on cost-effective translation. Their V3.2 model is optimized for translation tasks and offers competitive pricing.

ModelInput (1M tokens)Output (1M tokens)Notes
DeepSeek V3.2~$0.22~$0.34Remarkably cheap for the quality

DeepSeek V3 has emerged as a budget-first option for teams that want LLM-quality output at near-MT prices. Particularly strong for East Asian languages (Mandarin, Japanese, Korean), which makes it interesting for teams targeting those markets where Google and DeepL are less differentiated. Available via OpenRouter.

OpenRouter: accessing multiple models via one API

OpenRouter is an API gateway that provides access to models from multiple providers like Claude, GPT, Gemini, DeepSeek, Llama, Mistral, through a single endpoint. It charges provider list prices plus a small markup.

The main practical use case for localization is mixing models per task: cheaper, faster models for bulk UI string runs and higher-quality models for tone-sensitive copy, without maintaining separate API keys and billing accounts per provider.

Real cost scenarios

These estimates assume efficient batching, with system prompt overhead amortized across many strings per request.

Scenario: 10,000 translation keys, English source, translating into 5 languages (DE, FR, ES, PT, JA)

Assume average key length of 60 characters = 600,000 characters of source text × 5 languages = 3M total characters to translate.

ProviderEst. costNotes
Google NMT~$60After free tier
DeepL API Growth~$88Strong for DE/FR/ES/PT; limited JA quality
GPT-4o-mini~$10-15Context-aware, all 5 languages
Gemini 2.5 Flash~$15-25Fast, cost-efficient
Claude Haiku 4.5~$30-45Good quality/cost balance
GPT-4o~$65-100Premium quality

Key insight: GPT-4o-mini at $0.15/M input often undercuts traditional MT services in absolute cost, while adding context awareness and support for custom instructions. The per-character mental model from traditional MT underestimates how competitive LLMs have become at the budget tier.

Scenario: Only new strings (smart caching in action)

A product with 10,000 keys that ships weekly and adds 150 new keys per sprint:

  • Without caching: paying to re-translate all 10,000 keys every run
  • With caching: paying to translate only 150 new keys per run

At $0.15/M input tokens (GPT-4o-mini), 150 keys at 60 chars average = 9,000 characters ≈ 2,250 input tokens + 2,250 output tokens. Total per sprint: under $0.01.

This is why translation memory and smart caching: sending only new or changed keys to the API, is more impactful than provider choice for most teams.

How to choose

Use this as a starting framework:

Use Google NMT if:

  • You need broad language coverage (100+ languages, especially Asian and African scripts)
  • You want predictable character-based pricing with a permanent free tier
  • You need fast turnaround for large volumes

Use DeepL if:

  • Your target languages are primarily European (DE, FR, ES, IT, NL, PL, etc.)
  • Customer-facing copy quality matters
  • You are willing to pay a small premium for better natural-language output in supported pairs

Use GPT-4o-mini or Gemini 2.5 Flash if:

  • You want LLM context awareness at near-MT prices
  • You have varied content types (UI strings, help docs, marketing copy) in one workflow
  • You want one provider that covers all your languages without quality gaps

Use Claude Sonnet or GPT-4o if:

  • The content is brand-sensitive, marketing-heavy, or voice-driven
  • Tone accuracy justifies a higher per-token cost
  • You want the best output quality with no manual post-editing

Use DeepSeek V3 via OpenRouter if:

  • East Asian languages are a priority
  • Cost is a hard constraint and you need LLM-quality output
  • You are comfortable with a less established provider

One more cost lever: your TMS

Provider pricing is only one part of the equation. How your translation management system sends content to the API affects cost at least as much.

A tool that re-translates your entire file every run costs 100x more than one that tracks which keys changed and sends only the delta. SimpleLocalize tracks translation state per key and only sends new or modified strings to the AI model. Combined with per-project context settings, the effective cost per language per sprint is usually a few dollars for most SaaS teams.

For a full walkthrough of how AI translation workflows fit into a broader localization pipeline, see our guide on AI-powered localization workflows and the AI-Powered Localization control center page.

Summary

Here is a quick cost comparison for 1 million characters of input text (assuming output length is roughly the same):

ProviderBillingCost at 1M chars/tokens inputStrength
Google NMTper character$20Breadth, 249 languages
DeepL API Growthper character$27.50/moEuropean language quality
GPT-4o-miniper token~$3-5 equiv.Cost-effective LLM quality
Gemini 2.5 Flashper token~$5-8 equiv.Speed and throughput
Claude Haiku 4.5per token~$20-30 equiv.Balanced quality
GPT-4oper token~$50-80 equiv.Premium general quality
Claude Sonnet 4.6per token~$60-100 equiv.Nuanced, tone-accurate output
DeepSeek V3per token~$3-5 equiv.Budget LLM, strong on Asian pairs

Character equivalents are rough estimates assuming 250 input + 250 output tokens per 1,000 source characters, with batched system prompts.

Pricing changes regularly, so always verify current rates before locking in a budget. Links to official pricing pages: OpenAI, Google Cloud Translation, DeepL API, OpenRouter.

Kinga Pomykała
Kinga Pomykała
Content creator of SimpleLocalize

Get started with SimpleLocalize

  • All-in-one localization platform
  • Web-based translation editor for your team
  • Auto-translation, QA-checks, AI and more
  • See how easily you can start localizing your product.
  • Powerful API, hosting, integrations and developer tools
  • Unmatched customer support
Start for free
No credit card required5-minute setup
"The product
and support
are fantastic."
Laars Buur|CTO
"The support is
blazing fast,
thank you Jakub!"
Stefan|Developer
"Interface that
makes any dev
feel at home!"
Dario De Cianni|CTO
"Excellent app,
saves my time
and money"
Dmitry Melnik|Developer