gpt-5-nano

gpt-5-nano

gpt-5-nano is optimized for speed, ideal for applications requiring low latency.
Azure OpenAI
Direct from Azure
Version: 2025-08-07
Direct from Azure models are a select portfolio curated for their market-differentiated capabilities:
  • Secure and managed by Microsoft: Purchase and manage models directly through Azure with a single license, consistent support, and no third-party dependencies, backed by Azure's enterprise-grade infrastructure.
  • Streamlined operations: Benefit from unified billing, governance, and seamless PTU portability across models hosted on Azure - all part of Microsoft Foundry.
  • Future-ready flexibility: Access the latest models as they become available, and easily test, deploy, or switch between them within Microsoft Foundry; reducing integration effort.
  • Cost control and optimization: Scale on demand with pay-as-you-go flexibility or reserve PTUs for predictable performance and savings.
Learn more about Direct from Azure models .

About this model

gpt-5-nano is a new class of reasoning model focuses on speed and efficiency with reasoning power for rich question and responses.

Key model capabilities

  • Supports ultra-fast, low-latency use cases
  • Now supporting minimal reasoning, a new verbosity setting, and the "customs" tool for raw text output.
  • New "allowed tools" tool choice that enables you to specify multiple tools in the tool choice instead of just one
  • New "preamble" support, allowing the model to "think" before calling a tool. This is always enabled and controlled through prompting.
  • gpt-5-nano supports multimodal inputs, real-time streaming and full tool support for smarter, more dynamic user experiences

Quick facts

Model providerAzure OpenAI
TypeChat completion, Responses
LifecycleGenerally available (GA)
Input typetext, image
Output typetext
Context window400k
Token limits128k output

Related Models