Azure OpenAI
Azure OpenAIMicrosoft-hosted OpenAI models, including GPT-4 and Codex, offering enterprise-grade security and compliance.
Total Models: 36
sora
sora

An efficient AI solution to generate videos

video-generation
o3
o3

o3 includes significant improvements on quality and safety while supporting the existing features of o1 and delivering comparable or better performance.

chat-completion
o4-mini
o4-mini

o4-mini includes significant improvements on quality and safety while supporting the existing features of o3-mini and delivering comparable or better performance.

chat-completion
gpt-image-1
gpt-image-1

An efficient AI solution for diverse text and image tasks, including text to image, image to image, inpainting, and prompt transformation.

text-to-image
gpt-4.1
gpt-4.1

gpt-4.1 outperforms gpt-4o across the board, with major gains in coding, instruction following, and long-context understanding

chat-completion
gpt-4.1-mini
gpt-4.1-mini

gpt-4.1-mini outperform gpt-4o-mini across the board, with major gains in coding, instruction following, and long-context handling

chat-completion
gpt-4.1-nano
gpt-4.1-nano

gpt-4.1-nano provides gains in coding, instruction following, and long-context handling along with lower latency and cost

chat-completion
gpt-4.5-preview
gpt-4.5-preview

the largest and strongest general purpose model in the gpt model family up to date, best suited for diverse text and image tasks.

chat-completion
o3-mini
o3-mini

o3-mini includes the o1 features with significant cost-efficiencies for scenarios requiring high performance.

chat-completion
gpt-4o-mini-tts
gpt-4o-mini-tts

An advanced text-to-speech solution designed to convert written text into natural-sounding speech.

text-to-speech
gpt-4o-transcribe
gpt-4o-transcribe

A cutting-edge speech-to-text solution that deliverables reliable and accurate transcripts.

speech-to-text
gpt-4o-mini-transcribe
gpt-4o-mini-transcribe

A highly efficient and cost effective speech-to-text solution that deliverables reliable and accurate transcripts.

speech-to-text
computer-use-preview
computer-use-preview

computer-use-preview is the model for Computer Use Agent for use in Responses API. You can use computer-use-preview model to get instructions to control a browser on your computer screen and take action on a user's behalf.

responses
gpt-4o-mini-audio-preview
gpt-4o-mini-audio-preview

Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.

audio-generation
gpt-4o-mini-realtime-preview
gpt-4o-mini-realtime-preview

Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.

audio-generation
o1
o1

Focused on advanced reasoning and solving complex problems, including math and science tasks. Ideal for applications that require deep contextual understanding and agentic workflows.

chat-completion
o1-mini
o1-mini

Smaller, faster, and 80% cheaper than o1-preview, performs well at code generation and small context operations.

chat-completion
gpt-4o
gpt-4o

OpenAI's most advanced multimodal model in the gpt-4o family. Can handle both text and image inputs.

chat-completion
gpt-4o-mini
gpt-4o-mini

An affordable, efficient AI solution for diverse text and image tasks.

chat-completion
gpt-4o-audio-preview
gpt-4o-audio-preview

Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.

audio-generation
gpt-4o-realtime-preview
gpt-4o-realtime-preview

The gpt4orealtimepreview model introduces a new era in AI interaction by incorporating the new audio modality powered by gpt4o. This new modality allows for seamless speechtospeech and texttospeech applications, providing a richer and more engaging user experience. Engineered for speed and e

audio-generation
o1-preview
o1-preview

Focused on advanced reasoning and solving complex problems, including math and science tasks. Ideal for applications that require deep contextual understanding and agentic workflows.

chat-completion
gpt-4
gpt-4

gpt4 is a large multimodal model that accepts text or image inputs and outputs text. It can solve complex problems with greater accuracy than any of our previous models, thanks to its extensive general knowledge and advanced reasoning capabilities. gpt4 provides a wide range of model versions to

chat-completion
dall-e-3
dall-e-3

DALLE 3 generates images from text prompts that are provided by the user. DALLE 3 is generally available for use on Azure OpenAI. The image generation API creates an image from a text prompt. It does not edit existing images or create variations. Learn more at: <https://learn.microsoft.com/azur

text-to-image
davinci-002
davinci-002

Davinci002 is the latest versions of Davinci, gpt3 base models. Davinci002 replaces the deprecated Curie and Davinci models. It is a smaller, faster model that is primarily used for fine tuning tasks. This model supports 16384 max input tokens and training data is up to Sep 2021. Davinci002 su

completions
gpt-35-turbo-16k
gpt-35-turbo-16k

gpt3.5 models can understand and generate natural language or code. The most capable and cost effective model in the gpt3.5 family is gpt3.5turbo, which has been optimized for chat and works well for traditional completions tasks as well. gpt3.5turbo is available for use with the Chat Completi

chat-completion
gpt-35-turbo-instruct
gpt-35-turbo-instruct

gpt3.5 models can understand and generate natural language or code. The most capable and cost effective model in the gpt3.5 family is gpt3.5turbo, which has been optimized for chat and works well for traditional completions tasks as well. gpt3.5turbo is available for use with the Chat Completi

chat-completion
gpt-35-turbo
gpt-35-turbo

The gpt35turbo (also known as ChatGPT) is the most capable and costeffective model in the gpt3.5 family which has been optimized for chat using the Chat Completions API. It is a language model designed for conversational interfaces and the model behaves differently than previous gpt3 models. Pr

chat-completion
babbage-002
babbage-002

Babbage002 is the latest versions of Babbage, GPT3 base models. Babbage002 replaces the deprecated Ada and Babbage models. It is a smaller, faster model that is primarily used for fine tuning tasks. This model supports 16384 max input tokens and training data is up to Sep 2021. Bababge002 suppo

completions
tts
tts

TTS is a model that converts text to natural sounding speech. TTS is optimized for realtime or interactive scenarios. For offline scenarios, TTSHD provides higher quality. The API supports six different voices. Max request data size: 4,096 chars can be converted from text to speech per API request

text-to-speech
text-embedding-ada-002
text-embedding-ada-002

textembeddingada002 outperforms all the earlier embedding models on text search, code search, and sentence similarity tasks and gets comparable performance on text classification. Embeddings are numerical representations of concepts converted to number sequences, which make it easy for computers

embeddings
text-embedding-3-large
text-embedding-3-large

Text-embedding-3 series models are the latest and most capable embedding model from OpenAI.

embeddings
gpt-4-32k
gpt-4-32k

gpt4 can solve difficult problems with greater accuracy than any of the previous OpenAI models. Like gpt35turbo, gpt4 is optimized for chat but works well for traditional completions tasks. The gpt4 supports 8192 max input tokens and the gpt432k supports up to 32,768 tokens. Note: this model

chat-completion
tts-hd
tts-hd

TTSHD is a model that converts text to natural sounding speech. TTS is optimized for realtime or interactive scenarios. For offline scenarios, TTSHD provides higher quality. The API supports six different voices. Max request data size: 4,096 chars can be converted from text to speech per API requ

text-to-speech
whisper
whisper

The Whisper models are trained for speech recognition and translation tasks, capable of transcribing speech audio into the text in the language it is spoken (automatic speech recognition) as well as translated into English (speech translation). Researchers at OpenAI developed the models to study th

automatic-speech-recognition
text-embedding-3-small
text-embedding-3-small

Text-embedding-3 series models are the latest and most capable embedding model from OpenAI.

embeddings