gpt-4o-transcribe

A cutting-edge speech-to-text solution that deliverables reliable and accurate transcripts.

Azure OpenAI

Direct from Azure

Version: 2025-03-20

Direct from Azure models

Direct from Azure models are a select portfolio curated for their market-differentiated capabilities:

Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no third-party dependencies, backed by Azure's enterprise-grade infrastructure.
Streamlined operations: Benefit from unified billing, governance, and seamless PTU portability across models hosted on Azure - all part of Microsoft Foundry.
Future-ready flexibility: Access the latest models as they become available, and easily test, deploy, or switch between them within Microsoft Foundry; reducing integration effort.
Cost control and optimization: Scale on demand with pay-as-you-go flexibility or reserve PTUs for predictable performance and savings.

Learn more about Direct from Azure models .

Key capabilities

About this model

The gpt-4o-transcribe model is a cutting-edge speech-to-text solution that leverages the advanced capabilities of GPT-4o to deliver highly accurate audio transcriptions.

Key model capabilities

This model offers significant improvements in word error rate and language recognition, surpassing the performance of previous models like Whisper. Designed for precision and efficiency, gpt-4o-transcribe aims to provide users with reliable and accurate transcripts. With a maximum output of 2,000 tokens, gpt-4o-transcribe can generate detailed and comprehensive transcriptions.

Use cases

Pricing

Technical specs

Training disclosure

Distribution

More information

Quick facts

Model providerAzure OpenAI

TypeSpeech to text

LifecycleGenerally available (GA)

Input typetext, audio

Output typetext

Context window16000

Token limits2000 output

PricingView pricing

gpt-4o-transcribe

About this model

Key model capabilities

Quick facts

Quick Start