OpenAI o3-mini
OpenAI o3-mini
Version: 2025-01-31
OpenAILast updated December 2025
o3-mini includes the o1 features with significant cost-efficiencies for scenarios requiring high performance.
Reasoning
Multilingual
Coding

Direct from Azure models

Direct from Azure models are a select portfolio curated for their market-differentiated capabilities:
  • Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no third-party dependencies, backed by Azure's enterprise-grade infrastructure.
  • Streamlined operations: Benefit from unified billing, governance, and seamless PTU portability across models hosted on Azure - all part of Microsoft Foundry.
  • Future-ready flexibility: Access the latest models as they become available, and easily test, deploy, or switch between them within Microsoft Foundry; reducing integration effort.
  • Cost control and optimization: Scale on demand with pay-as-you-go flexibility or reserve PTUs for predictable performance and savings.
Learn more about Direct from Azure models .

Key capabilities

About this model

This model is provided through the Azure OpenAI Service.

Responsible AI considerations

Safety techniques

OpenAI has incorporated additional safety measures into the o1 models, including new techniques to help the models refuse unsafe requests. These advancements make the o1 series some of the most robust models available.

Safety evaluations

OpenAI measures safety is by testing how well models continue to follow its safety rules if a user tries to bypass them (known as "jailbreaking"). In OpenAI's internal tests, GPT-4o scored 22 (on a scale of 0-100) while o1-preview model scored 84. You can read more about this in the OpenAI's system card and research post .

Known limitations

o1 model does not include all the features available in other models.

Acceptable use

Acceptable use policy

The provider has not supplied this information.

Quality and performance evaluations

Source: OpenAI In OpenAI's internal tests, GPT-4o scored 22 (on a scale of 0-100) while o1-preview model scored 84.

Benchmarking methodology

Source: OpenAI OpenAI measures safety is by testing how well models continue to follow its safety rules if a user tries to bypass them (known as "jailbreaking").

Public data summary

Source: OpenAI The provider has not supplied this information.
Model Specifications
Context Length200000
Quality Index0.87
LicenseCustom
Training DataSeptember 2023
Last UpdatedDecember 2025
Input TypeText
Output TypeText
ProviderOpenAI
Languages27 Languages