OpenAI gpt-audio-mini
OpenAI gpt-audio-mini
Version: 2025-10-06
OpenAILast updated October 2025
Best suited for rich, asynchronous audio input/output interactions, such as creating spoken summaries from text.
gpt-audio-mini enables voice-based interaction by processing spoken prompts and generating responses, capturing subtle audio cues for deeper, more immersive experiences. Note: For customers interested in lower latency audio responses, gpt-realtime-mini may still be more suitable. These audio features can be utilized in various ways:
  • Create spoken summaries from text, offering a more engaging method to present information.
  • Analyze the sentiment of audio recordings, converting vocal nuances into text-based insights.
  • Facilitate asynchronous speech-in, speech-out interactions

Model provider

This model is provided through the Azure OpenAI Service.

Relevant documents

The following documents are applicable:
Model Specifications
Context Length128000
LicenseCustom
Training DataOctober 2025
Last UpdatedOctober 2025
Input TypeAudio,Text
Output TypeAudio,Text
PublisherOpenAI
Languages27 Languages