Mistral OCR 25.03
Mistral OCR 25.03
Version: 1
Mistral AILast updated August 2025
Document conversion to markdown with interleaved images and text
Vision
Low latency

Key capabilities

About this model

Mistral OCR 25.03 excels in understanding complex document elements, including interleaved imagery, mathematical expressions, tables, and advanced layouts such as LaTeX formatting. The model enables deeper understanding of rich documents such as scientific papers with charts, graphs, equations and figures.

Key model capabilities

Mistral OCR 25.03 has consistently outperformed other leading OCR models in rigorous benchmark tests. Its superior accuracy across multiple aspects of document analysis is illustrated below. We extract embedded images from documents along with text. The other LLMs compared below, do not have that capability. For a fair comparison, we evaluate them on our internal "text-only" test-set containing various publication papers, and PDFs from the web; below:
ModelOverallMathMultilingualScannedTables
Google Document AI83.4280.2986.4292.7778.16
Azure OCR89.5285.7287.5294.6589.52
Gemini-1.5-Flash-00290.2389.1186.7694.8790.48
Gemini-1.5-Pro-00289.9288.4886.3396.1589.71
Gemini-2.0-Flash-00188.6984.1885.8095.1191.46
GPT-4o-2024-11-2089.7787.5586.0094.5891.70
Mistral OCR 25.0394.8994.2989.5598.9696.12
ModelFuzzy Match in Generation
Google-Document-AI95.88
Gemini-2.0-Flash-00196.53
Azure OCR97.31
Mistral OCR 25.0399.02

Use cases

See Responsible AI for additional considerations for responsible use.

Key use cases

Help your organization elevate its knowledge by transforming your extensive document repositories into actions and solutions. Some of the key use cases where Mistral OCR 25.03 is making a significant impact include:
  1. Digitizing scientific research: Leading research institutions have been experimenting with Mistral OCR to convert scientific papers and journals into AI-ready formats, making them accessible to downstream intelligence engines. This has facilitated measurably faster collaboration and accelerated scientific workflows.
  2. Preserving historical and cultural heritage: Organizations and nonprofits that are custodians of heritage have been using Mistral OCR to digitize historical documents and artifacts, ensuring their preservation and making them accessible to a broader audience.
  3. Streamlining customer service: Customer service departments are exploring Mistral OCR to transform documentation and manuals into indexed knowledge, reducing response times and improving customer satisfaction.
  4. Making literature across design, education, legal, etc. AI ready: Mistral OCR has also been helping companies convert technical literature, engineering drawings, lecture notes, presentations, regulatory filings and much more into indexed, answer-ready formats, unlocking intelligence and productivity across millions of documents.

Out of scope use cases

The provider has not supplied this information.

Pricing

Pricing is based on a number of factors, including deployment type and tokens used. See pricing details here.

Technical specs

Being lighter weight than most models in the category, Mistral OCR 25.03 performs significantly faster than its peers, processing thousands of pages per minute. The ability to rapidly process documents ensures continuous learning and improvement even for high-throughput environments.

Training cut-off date

The provider has not supplied this information.

Training time

The provider has not supplied this information.

Input formats

It takes images and PDFs as input and extracts content in an ordered interleaved text and images.

Output formats

The OCR endpoint returns .MD format. Combine it with Mistral Small 3.1 to return JSON format.

Supported languages

Since Mistral's founding, we have aspired to serve the world with our models, and consequently strived for multilingual capabilities across our offerings. Mistral OCR 25.03 takes this to a new level, being able to parse, understand, and transcribe thousands of scripts, fonts, and languages across all continents. This versatility is crucial for both global organizations that handle documents from diverse linguistic backgrounds, as well as hyperlocal businesses serving niche markets.
LanguageAzure OCRGoogle Doc AIGemini-2.0-Flash-001Mistral OCR 2503
ru97.3595.5696.5899.09
fr97.5096.3697.0699.20
hi96.4595.6594.9997.55
zh91.4090.8991.8597.11
pt97.9696.2497.2599.42
de98.3997.0997.1999.51
es98.5497.5297.7599.54
tr95.9193.8594.6697.00
uk97.8196.2496.7099.29
it98.3197.6997.6899.42
ro96.4595.1495.8898.79

Sample JSON response

The provider has not supplied this information.

Model architecture

The provider has not supplied this information.

Long context

The provider has not supplied this information.

Optimizing model performance

To further enhance its capabilities, Mistral OCR 25.03 can be coupled with Mistral Small 3.1 to reformat the results. This combination ensures that the extracted content is not only accurate but also presented in a structured and coherent manner, making it suitable for various downstream applications and analyses. Have a look at this cookbook to combine OCR with another model.

Additional assets

See this cookbook for a detailed tutorial.

Training disclosure

Training, testing and validation

The provider has not supplied this information.

Distribution

Distribution channels

The provider has not supplied this information.

More information

PLAYGROUND WILL SOON BE AVAILABLE FOR OCR

Responsible AI considerations

Safety techniques

The provider has not supplied this information.

Safety evaluations

The provider has not supplied this information.

Known limitations

The provider has not supplied this information.

Acceptable use

Acceptable use policy

The provider has not supplied this information.

Quality and performance evaluations

Source: Mistral AI Top-tier benchmarks Mistral OCR 25.03 has consistently outperformed other leading OCR models in rigorous benchmark tests. Its superior accuracy across multiple aspects of document analysis is illustrated below. We extract embedded images from documents along with text. The other LLMs compared below, do not have that capability. For a fair comparison, we evaluate them on our internal "text-only" test-set containing various publication papers, and PDFs from the web; below:
ModelOverallMathMultilingualScannedTables
Google Document AI83.4280.2986.4292.7778.16
Azure OCR89.5285.7287.5294.6589.52
Gemini-1.5-Flash-00290.2389.1186.7694.8790.48
Gemini-1.5-Pro-00289.9288.4886.3396.1589.71
Gemini-2.0-Flash-00188.6984.1885.8095.1191.46
GPT-4o-2024-11-2089.7787.5586.0094.5891.70
Mistral OCR 25.0394.8994.2989.5598.9696.12
Natively multilingual Since Mistral's founding, we have aspired to serve the world with our models, and consequently strived for multilingual capabilities across our offerings. Mistral OCR 25.03 takes this to a new level, being able to parse, understand, and transcribe thousands of scripts, fonts, and languages across all continents. This versatility is crucial for both global organizations that handle documents from diverse linguistic backgrounds, as well as hyperlocal businesses serving niche markets.
ModelFuzzy Match in Generation
Google-Document-AI95.88
Gemini-2.0-Flash-00196.53
Azure OCR97.31
Mistral OCR 25.0399.02
Benchmarks per language
LanguageAzure OCRGoogle Doc AIGemini-2.0-Flash-001Mistral OCR 2503
ru97.3595.5696.5899.09
fr97.5096.3697.0699.20
hi96.4595.6594.9997.55
zh91.4090.8991.8597.11
pt97.9696.2497.2599.42
de98.3997.0997.1999.51
es98.5497.5297.7599.54
tr95.9193.8594.6697.00
uk97.8196.2496.7099.29
it98.3197.6997.6899.42
ro96.4595.1495.8898.79

Benchmarking methodology

Source: Mistral AI The provider has not supplied this information.

Public data summary

Source: Mistral AI The provider has not supplied this information.
Model Specifications
Context Length128000
LicenseCustom
Last UpdatedAugust 2025
Input TypePdf,Image
Output TypeText
ProviderMistral AI
Languages27 Languages