Overview
OpenAI aims to create "safe and beneficial" artificial general intelligence, defined as highly autonomous systems that outperform humans at most economically valuable work. The organization leads today's AI with some of the world's most advanced multimodal foundation models, including gpt‑5.1, and continues to compress state‑of‑the‑art performance into smaller, less expensive variants like gpt-5.1-mini. The latest gpt-4o-transcribe-diarize model performs speech tasks, while gpt-image-1 incorporates native image generation and editing.Key OpenAI Models (November 2025)
- gpt‑5.1 – Best‑in‑class reasoning, code, audio & vision in one API.
- gpt‑5-mini – Small‑footprint model at a lower cost.
- gpt-4o-transcribe-diarize – Upgraded speech models with lower error rates and customizable voices.
- gpt‑image‑1 – Next‑gen text‑to‑image + in‑painting that supersedes DALL·E 3.
- gpt-5.1-codex – Designed for steerability, front end development, and interactivity.
- Sora-2 – Creative powerhouse seamlessly integrated into a platform built for innovation, trust, and scale.
Why OpenAI on Azure
All OpenAI endpoints inherit enterprise‑grade Content Safety, flexible serverless or provisioned‑throughput deployments, Azure billing, and private‑network inference so you can move from prototype to production in days, not months.gpt-5.2-chat (preview) is an advanced, natural, multimodal, and context-aware conversations for enterprise applications.
GPT-5.2 is engineered for enterprise agent scenarios—delivering structured, auditable outputs, reliable tool use, and governed integrations.
gpt-5.1-codex-max is agentic coding model designed to streamline complex development workflows with advanced efficiency
gpt-5.1 is designed for logic-heavy and multi-step tasks.
gpt-5.1-codex is designed for steerability, front end development, and interactivity.
gpt-5-chat (preview) is an advanced, natural, multimodal, and context-aware conversations for enterprise applications.
Sora 2 in Azure AI Foundry isn't just another video generation tool; it's a creative powerhouse, seamlessly integrated into a platform built for innovation, trust, and scale.
gpt-5.1-chat (preview) is an advanced, natural, multimodal, and context-aware conversations for enterprise applications.
gpt-5.1-codex-mini is designed for steerability, front end development, and interactivity.
gpt-5-pro uses more compute to think harder and provide consistently better answers.
gpt-5 is designed for logic-heavy and multi-step tasks.
gpt-4.1 outperforms gpt-4o across the board, with major gains in coding, instruction following, and long-context understanding
gpt-4.1-mini outperform gpt-4o-mini across the board, with major gains in coding, instruction following, and long-context handling
A cutting-edge speech-to-text solution that deliverables reliable and accurate transcripts; now equipped with diarization support aka identifying different speakers through the transcription.
gpt-5-codex is designed for steerability, front end development, and interactivity.
o3 includes significant improvements on quality and safety while supporting the existing features of o1 and delivering comparable or better performance.
gpt-realtime-mini is a smaller version of gpt-realtime S2S (speech to speech) model built on chive architecture. This model excels at instruction following and is optimized for cost efficiency.
gpt-5-nano is optimized for speed, ideal for applications requiring low latency.
gpt-5-mini is a lightweight version for cost-sensitive applications.
o4-mini includes significant improvements on quality and safety while supporting the existing features of o3-mini and delivering comparable or better performance.
gpt-4.1-nano provides gains in coding, instruction following, and long-context handling along with lower latency and cost
o3-mini includes the o1 features with significant cost-efficiencies for scenarios requiring high performance.
Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.
Push the open model frontier with GPT-OSS models, released under the permissive Apache 2.0 license, allowing anyone to use, modify, and deploy them freely.
A new S2S (speech to speech) model with improved instruction following.
Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.
The o3 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide consistently better answers.
codex-mini is a fine-tuned variant of the o4-mini model, designed to deliver rapid, instruction-following performance for developers working in CLI workflows. Whether you're automating shell commands, editing scripts, or refactoring repositories, Codex-Min
the largest and strongest general purpose model in the gpt model family up to date, best suited for diverse text and image tasks.
Push the open safety model frontier with gpt-oss-safeguard models, released under the permissive Apache 2.0 license, allowing anyone to use, modify, and deploy them freely.
Push the open safety model frontier with gpt-oss-safeguard models, released under the permissive Apache 2.0 license, allowing anyone to use, modify, and deploy them freely.
Push the open model frontier with GPT-OSS models, released under the permissive Apache 2.0 license, allowing anyone to use, modify, and deploy them freely.
The o3 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide consistently better answers.
An efficient AI solution to generate videos
An efficient AI solution for diverse text and image tasks, including text to image, image to image, inpainting, and prompt transformation.
An advanced text-to-speech solution designed to convert written text into natural-sounding speech.
A cutting-edge speech-to-text solution that deliverables reliable and accurate transcripts.
A highly efficient and cost effective speech-to-text solution that deliverables reliable and accurate transcripts.
computer-use-preview is the model for Computer Use Agent for use in Responses API. You can use computer-use-preview model to get instructions to control a browser on your computer screen and take action on a user's behalf.
Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.
Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.
Focused on advanced reasoning and solving complex problems, including math and science tasks. Ideal for applications that require deep contextual understanding and agentic workflows.
Smaller, faster, and 80% cheaper than o1-preview, performs well at code generation and small context operations.
OpenAI's most advanced multimodal model in the gpt-4o family. Can handle both text and image inputs.
An affordable, efficient AI solution for diverse text and image tasks.
Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.
Direct from Azure models Direct from Azure models are a select portfolio curated for their marketdifferentiated capabilities: Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no thirdparty dependencies, backed b
Focused on advanced reasoning and solving complex problems, including math and science tasks. Ideal for applications that require deep contextual understanding and agentic workflows.
Direct from Azure models Direct from Azure models are a select portfolio curated for their marketdifferentiated capabilities: Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no thirdparty dependencies, backed b
Direct from Azure models Direct from Azure models are a select portfolio curated for their marketdifferentiated capabilities: Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no thirdparty dependencies, backed b
Azure Direct Models Direct from Azure models are a select portfolio curated for their marketdifferentiated capabilities: Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no thirdparty dependencies, backed by Az