Ministral-3B
Ministral 3B is a state-of-the-art Small Language Model (SLM) optimized for edge computing and on-device applications. As it is designed for low-latency and compute-efficient inference, it it also the perfect model for standard GenAI applications that have
Models from Microsoft, Partners, and Community models are a select portfolio of curated models both general-purpose and niche models across diverse scenarios by developed by Microsoft teams, partners, and community contributors
- Managed by Microsoft: Purchase and manage models directly through Azure with a single license, world class support and enterprise grade Azure infrastructure
- Validated by providers: Each model is validated and maintained by its respective provider, with Azure offering integration and deployment guidance.
- Innovation and agility: Combines Microsoft research models with rapid, community-driven advancements.
- Seamless Azure integration: Standard Microsoft Foundry experience, with support managed by the model provider.
- Flexible deployment: Deployable as Managed Compute or Serverless API, based on provider preference.
About this model
Ministral 3B and Ministral 8B set a new frontier in knowledge, commonsense, reasoning, function-calling, and efficiency in the sub-10B category, and can be used or tuned to a variety of uses, from orchestrating agentic workflows to creating specialist task workers.Key model capabilities
Knowledge & Commonsense
| Model | MMLU | AGIEval | Winogrande | Arc-c | TriviaQA |
|---|---|---|---|---|---|
| Gemma 2 2B | 52.4 | 33.8 | 68.7 | 42.6 | 47.8 |
| Llama 3.2 3B | 56.2 | 37.4 | 59.6 | 43.1 | 50.7 |
| Ministral 3B | 60.9 | 42.1 | 72.7 | 64.2 | 56.7 |
| Mistral 7B | 62.4 | 42.5 | 74.2 | 67.9 | 62.5 |
| Llama 3.1 8B | 64.7 | 44.4 | 74.6 | 46.0 | 60.2 |
| Ministral 8B | 65.0 | 48.3 | 75.3 | 71.9 | 65.5 |
Code and Math
| Model | HumanEval (pass@1) | GSM8K (maj@8) |
|---|---|---|
| Gemma 2 2B | 20.1 | 35.5 |
| Llama 3.2 3B | 29.9 | 37.2 |
| Ministral 3B | 34.2 | 50.9 |
| Mistral 7B | 26.8 | 51.3 |
| Llama 3.1 8B | 37.8 | 61.7 |
| Ministral 8B | 34.8 | 64.5 |
Chat/Arena (gpt-4o judge)
| Model | MTBench | Arena Hard | Wild bench |
|---|---|---|---|
| Gemma 2 2B | 7.5 | 51.7 | 32.5 |
| Llama 3.2 3B | 7.2 | 46.0 | 27.2 |
| Ministral 3B | 8.1 | 64.3 | 36.3 |
| Mistral 7B | 6.7 | 44.3 | 33.1 |
| Llama 3.1 8B | 7.5 | 62.4 | 37.0 |
| Gemma 2 9B | 7.6 | 68.7 | 43.8 |
| Ministral 8B | 8.3 | 70.9 | 41.3 |
Quick facts
Model providerMistral AI
TypeChat completion
LifecycleGenerally available (GA)
Input typetext
Output typetext
Context window131.072k
Token limits4096 output
PricingView pricing