API Pricing

Mistral API Pricing & Cost Calculator (2026)

Current Mistral API pricing per million tokens — Medium, Small and the open-weight family — with a live cost calculator, worked examples, EU-jurisdiction notes and savings tips.

Updated 2026-06-17 · prices auto-refreshed

Compare Mistral in the calculator → Visit Mistral

Mistral is the leading European LLM provider and a favourite where data residency and open weights matter. Its catalogue mixes managed-API models with openly released ones, and its pricing tends to keep input and output closer together than the big US frontier labs — which can make it surprisingly economical for output-heavy work. This page covers the lineup, a worked example, why teams pick Mistral and how to keep the bill down. The table above is from a live feed; the cost calculator shows your real spend.

The Mistral lineup

Mistral's range, billed per million input and output tokens, spans:

Mistral Medium — the balanced workhorse, competitive with mid-tier models from larger labs, usually at a lower price.
Mistral Small — fast and inexpensive, for high-volume, simpler tasks.
Open-weight family (Ministral and others) — compact models you can run via the API or self-host.
Specialist models — including coding-focused options for code generation and completion.

The table above shows the live spread. A Batch tier offers roughly a 50% discount for asynchronous bulk jobs.

Why teams choose Mistral

European jurisdiction. For organisations with EU data-governance requirements, a European provider materially simplifies compliance and procurement.
Open weights. Several Mistral models are openly available, so you can prototype on the managed API and later self-host the same model for full data residency — without re-engineering your prompts. See our self-hosted inference review.
Output-friendly pricing. Because input and output rates are closer together than at some US labs, response-heavy workloads (chat, generation) can come out cheaper than the headline input price suggests.
Solid mid-tier price/quality. Mistral Medium is a strong default for general production work that doesn't need the absolute frontier.

A worked example

A chat product averages 1,200 input tokens and 800 output tokens per turn, 80,000 turns a month, on Mistral Medium:

cost_per_turn = (1,200/1e6 × input) + (800/1e6 × output)
monthly       = cost_per_turn × 80,000

Because output (800 tokens) is a large share of each turn, Mistral's relatively low output price helps here — a workload where a provider with a high output multiplier would cost more. Compare the same scenario against OpenAI and DeepSeek in the calculator to see where Mistral lands for your input/output ratio.

How to cut your Mistral bill

Default to Small for simple, high-volume tasks; use Medium where quality matters.
Cap output — even with friendlier output pricing, length still drives cost.
Batch offline jobs for the asynchronous discount.
Consider self-hosting an open-weight Mistral model once volume is high and steady, for fixed cost and full data residency.

When it's the right pick

Choose Mistral when European jurisdiction or an open-weight migration path matters, when your workload is output-heavy, or when you want a capable mid-tier model without frontier pricing. For the very top of reasoning quality you'll still compare against OpenAI and Anthropic; for the absolute lowest price, against DeepSeek. The calculator makes that comparison one click away.

Frequently asked questions

Is Mistral GDPR-friendly? As a European company, Mistral is a common choice for EU data-governance requirements, and its open-weight models can be self-hosted within your own infrastructure for maximum control. Always confirm specifics against your own compliance needs.

Can I self-host Mistral models? Several are open-weight, yes — you can run them on your own GPUs. Whether that beats the API on cost depends on utilisation; see the self-hosted cost breakdown.

How does Mistral Medium compare to GPT-5 or Claude Sonnet? It's a capable mid-tier model, often cheaper, and competitive for general work. For the hardest reasoning the frontier labs may still lead — test on your tasks.

Does Mistral offer batch discounts? Yes, a batch tier gives roughly 50% off for asynchronous bulk processing.

Compare Mistral across the full field in the LLM API cost calculator.

Prices are auto-refreshed from a live source and dated. Confirm current pricing on Mistral's page before committing.

Model	Context	API price / 1M in → out	Sample cost / request
Mistral Nemo fast	131K	$0.0200 → $0.0300	$0.000110
Mistral Small 3 fast	33K	$0.0500 → $0.0800	$0.000280
Ministral 3 3B 2512 fast	131K	$0.1000 → $0.1000	$0.000500
Mistral Small 3.2 24B fast	128K	$0.0750 → $0.2000	$0.000500
Voxtral Small 24B 2507 fast	32K	$0.1000 → $0.3000	$0.000700
Ministral 3 8B 2512 fast	262K	$0.1500 → $0.1500	$0.000750
Ministral 3 14B 2512 fast	262K	$0.2000 → $0.2000	$0.001000
Mistral Small 4 fast	262K	$0.1500 → $0.6000	$0.001200
Saba fast	33K	$0.2000 → $0.6000	$0.001400
Mistral Small 3.1 24B fast	128K	$0.3510 → $0.5550	$0.001959
Codestral 2508 fast	256K	$0.3000 → $0.9000	$0.002100
Mistral Large 3 2512 fast	262K	$0.5000 → $1.50	$0.003500
Devstral 2 2512 fast	262K	$0.4000 → $2.00	$0.003600
Mistral Medium 3.1 fast	131K	$0.4000 → $2.00	$0.003600
Mistral Medium 3 fast	131K	$0.4000 → $2.00	$0.003600
Mistral Medium 3.5 balanced	262K	$1.50 → $7.50	$0.0135
Mistral Large 2407 balanced	131K	$2.00 → $6.00	$0.0140
Mixtral 8x22B Instruct balanced	66K	$2.00 → $6.00	$0.0140
Mistral Large balanced	128K	$2.00 → $6.00	$0.0140

Mistral API Pricing & Cost Calculator (2026)

The Mistral lineup

Why teams choose Mistral

A worked example

How to cut your Mistral bill

When it's the right pick

Frequently asked questions

Mistral models & current pricing

OpenAI pricing

DeepSeek pricing

LLM API Cost Calculator