gpt-oss-20b

gpt-oss-20b

Push the open model frontier with GPT-OSS models, released under the permissive Apache 2.0 license, allowing anyone to use, modify, and deploy them freely.
Azure OpenAI
Version: 11

Key capabilities

About this model

OpenAI's open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

Key model capabilities

  • Permissive Apache 2.0 license: Build freely without copyleft restrictions or patent risk—ideal for experimentation, customization, and commercial deployment.
  • Configurable reasoning effort: Easily adjust the reasoning effort (low, medium, high) based on your specific use case and latency needs.
  • Full chain-of-thought: Gain complete access to the model's reasoning process, facilitating easier debugging and increased trust in outputs. It's not intended to be shown to end users.
  • Fine-tunable: Fully customize models to your specific use case through parameter fine-tuning.
  • Agentic capabilities: Use the models' native capabilities for function calling, web browsing , Python code execution , and Structured Outputs.
  • Native MXFP4 quantization: The models are trained with native MXFP4 precision for the MoE layer, making gpt-oss-120b run on a single H100 GPU and the gpt-oss-20b model run within 16GB of memory.
See Responsible AI for additional considerations for responsible use.

Key use cases

The provider has not supplied this information.

Out of scope use cases

The provider has not supplied this information.
Pricing is based on a number of factors, including deployment type and tokens used. See pricing details here.

Quick facts

Model providerAzure OpenAI
TypeChat completion
LifecycleGenerally available (GA)
Input typetext
Output typetext
Context window131.072k
Token limits131.072k output