Mistral OCR 25.03
Mistral OCR 25.03
Version: 1
Mistral AILast updated April 2025
Document conversion to markdown with interleaved images and text
Vision
Low latency
PLAYGROUND WILL SOON BE AVAILABLE FOR OCR The OCR endpoint returns .MD format. Combine it with Mistral Small 3.1 to return JSON format. See this cookbook for a detailed tutorial.
Mistral OCR 25.03 is an Optical Character Recognition API that sets a new standard in document understanding. Unlike other models, Mistral OCR comprehends each element of documents—media, text, tables, equations—with unprecedented accuracy and cognition. It takes images and PDFs as input and extracts content in an ordered interleaved text and images. As a result, Mistral OCR 25.03 is an ideal model to use in combination with a RAG system taking multimodal documents (such as slides or complex PDFs) as input. Mistral OCR 25.03 excels in understanding complex document elements, including interleaved imagery, mathematical expressions, tables, and advanced layouts such as LaTeX formatting. The model enables deeper understanding of rich documents such as scientific papers with charts, graphs, equations and figures. Being lighter weight than most models in the category, Mistral OCR 25.03 performs significantly faster than its peers, processing thousands of pages per minute. The ability to rapidly process documents ensures continuous learning and improvement even for high-throughput environments. To further enhance its capabilities, Mistral OCR 25.03 can be coupled with Mistral Small 3.1 to reformat the results. This combination ensures that the extracted content is not only accurate but also presented in a structured and coherent manner, making it suitable for various downstream applications and analyses. Have a look at this cookbook to combine OCR with another model.

Intended Use

Primary Use Cases

Help your organization elevate its knowledge by transforming your extensive document repositories into actions and solutions. Some of the key use cases where Mistral OCR 25.03 is making a significant impact include:
  1. Digitizing scientific research: Leading research institutions have been experimenting with Mistral OCR to convert scientific papers and journals into AI-ready formats, making them accessible to downstream intelligence engines. This has facilitated measurably faster collaboration and accelerated scientific workflows.
  2. Preserving historical and cultural heritage: Organizations and nonprofits that are custodians of heritage have been using Mistral OCR to digitize historical documents and artifacts, ensuring their preservation and making them accessible to a broader audience.
  3. Streamlining customer service: Customer service departments are exploring Mistral OCR to transform documentation and manuals into indexed knowledge, reducing response times and improving customer satisfaction.
  4. Making literature across design, education, legal, etc. AI ready: Mistral OCR has also been helping companies convert technical literature, engineering drawings, lecture notes, presentations, regulatory filings and much more into indexed, answer-ready formats, unlocking intelligence and productivity across millions of documents.
Top-tier benchmarks Mistral OCR 25.03 has consistently outperformed other leading OCR models in rigorous benchmark tests. Its superior accuracy across multiple aspects of document analysis is illustrated below. We extract embedded images from documents along with text. The other LLMs compared below, do not have that capability. For a fair comparison, we evaluate them on our internal “text-only” test-set containing various publication papers, and PDFs from the web; below:
ModelOverallMathMultilingualScannedTables
Google Document AI83.4280.2986.4292.7778.16
Azure OCR89.5285.7287.5294.6589.52
Gemini-1.5-Flash-00290.2389.1186.7694.8790.48
Gemini-1.5-Pro-00289.9288.4886.3396.1589.71
Gemini-2.0-Flash-00188.6984.1885.8095.1191.46
GPT-4o-2024-11-2089.7787.5586.0094.5891.70
Mistral OCR 25.0394.8994.2989.5598.9696.12
Natively multilingual Since Mistral’s founding, we have aspired to serve the world with our models, and consequently strived for multilingual capabilities across our offerings. Mistral OCR 25.03 takes this to a new level, being able to parse, understand, and transcribe thousands of scripts, fonts, and languages across all continents. This versatility is crucial for both global organizations that handle documents from diverse linguistic backgrounds, as well as hyperlocal businesses serving niche markets.
ModelFuzzy Match in Generation
Google-Document-AI95.88
Gemini-2.0-Flash-00196.53
Azure OCR97.31
Mistral OCR 25.0399.02
Benchmarks per language
LanguageAzure OCRGoogle Doc AIGemini-2.0-Flash-001Mistral OCR 2503
ru97.3595.5696.5899.09
fr97.5096.3697.0699.20
hi96.4595.6594.9997.55
zh91.4090.8991.8597.11
pt97.9696.2497.2599.42
de98.3997.0997.1999.51
es98.5497.5297.7599.54
tr95.9193.8594.6697.00
uk97.8196.2496.7099.29
it98.3197.6997.6899.42
ro96.4595.1495.8898.79
Model Specifications
Context Length128000
LicenseCustom
Last UpdatedApril 2025
Input TypePdf,Image
Output TypeText
PublisherMistral AI
Languages27 Languages