Ministral-3B

Ministral-3B

Ministral 3B is a state-of-the-art Small Language Model (SLM) optimized for edge computing and on-device applications. As it is designed for low-latency and compute-efficient inference, it it also the perfect model for standard GenAI applications that have
Mistral AI
Version: 1
Models from Microsoft, Partners, and Community models are a select portfolio of curated models both general-purpose and niche models across diverse scenarios by developed by Microsoft teams, partners, and community contributors
  • Managed by Microsoft: Purchase and manage models directly through Azure with a single license, world class support and enterprise grade Azure infrastructure
  • Validated by providers: Each model is validated and maintained by its respective provider, with Azure offering integration and deployment guidance.
  • Innovation and agility: Combines Microsoft research models with rapid, community-driven advancements.
  • Seamless Azure integration: Standard Microsoft Foundry experience, with support managed by the model provider.
  • Flexible deployment: Deployable as Managed Compute or Serverless API, based on provider preference.
Learn more about models from Microsoft, Partners, and Community

About this model

Ministral 3B and Ministral 8B set a new frontier in knowledge, commonsense, reasoning, function-calling, and efficiency in the sub-10B category, and can be used or tuned to a variety of uses, from orchestrating agentic workflows to creating specialist task workers.

Key model capabilities

Knowledge & Commonsense

ModelMMLUAGIEvalWinograndeArc-cTriviaQA
Gemma 2 2B52.433.868.742.647.8
Llama 3.2 3B56.237.459.643.150.7
Ministral 3B60.942.172.764.256.7
Mistral 7B62.442.574.267.962.5
Llama 3.1 8B64.744.474.646.060.2
Ministral 8B65.048.375.371.965.5

Code and Math

ModelHumanEval (pass@1)GSM8K (maj@8)
Gemma 2 2B20.135.5
Llama 3.2 3B29.937.2
Ministral 3B34.250.9
Mistral 7B26.851.3
Llama 3.1 8B37.861.7
Ministral 8B34.864.5

Chat/Arena (gpt-4o judge)

ModelMTBenchArena HardWild bench
Gemma 2 2B7.551.732.5
Llama 3.2 3B7.246.027.2
Ministral 3B8.164.336.3
Mistral 7B6.744.333.1
Llama 3.1 8B7.562.437.0
Gemma 2 9B7.668.743.8
Ministral 8B8.370.941.3

Quick facts

Model providerMistral AI
TypeChat completion
LifecycleGenerally available (GA)
Input typetext
Output typetext
Context window131.072k
Token limits4096 output