gpt-4o-mini-tts
An advanced text-to-speech solution designed to convert written text into natural-sounding speech.
Direct from Azure models are a select portfolio curated for their market-differentiated capabilities:
- Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no third-party dependencies, backed by Azure's enterprise-grade infrastructure.
- Streamlined operations: Benefit from unified billing, governance, and seamless PTU portability across models hosted on Azure - all part of Microsoft Foundry.
- Future-ready flexibility: Access the latest models as they become available, and easily test, deploy, or switch between them within Microsoft Foundry; reducing integration effort.
- Cost control and optimization: Scale on demand with pay-as-you-go flexibility or reserve PTUs for predictable performance and savings.
About this model
The gpt-4o-mini-tts model is an advanced text-to-speech solution designed to convert written text into natural-sounding speech. Leveraging the capabilities of GPT-4o, this model offers customizable voice output, allowing developers to instruct the model to speak in specific ways, such as "talk like a sympathetic customer service agent."Key model capabilities
- Customizable voice output with the ability to instruct the model to speak in specific ways
- Natural-sounding speech generation ideal for audiobooks, podcasts, and interactive voice agents
- Expressive and dynamic voice generation capabilities
- Processing of substantial text inputs with support for up to 2,000 tokens
Quick facts
Model providerAzure OpenAI
TypeText to speech
LifecycleGenerally available (GA)
Input typetext, audio
Output typetext, audio
Context window2000
Token limits2000 output
PricingView pricing