gpt-35-turbo-16k
gpt-35-turbo-16k
Version: 0613
OpenAILast updated September 2024
gpt-3.5 models can understand and generate natural language or code. The most capable and cost effective model in the gpt-3.5 family is gpt-3.5-turbo, which has been optimized for chat and works well for traditional completions tasks as well. gpt-3.5-turbo is available for use with the Chat Completions API. gpt-3.5-turbo Instruct has similar capabilities to text-davinci-003 using the Completions API instead of the Chat Completions API. We recommend using gpt-3.5-turbo and gpt-3.5-turbo-instruct over legacy gpt-3.5 and gpt-3 models.
  • gpt-35-turbo
  • gpt-35-turbo-16k
  • gpt-35-turbo-instruct
You can see the token context length supported by each model in the model summary table. To learn more about how to interact with gpt-3.5-turbo and the Chat Completions API check out our in-depth how-to.
Model IDModel AvailabilityMax Request (tokens)Training Data (up to)
gpt-35-turbo1 (0301)East US, France Central, South Central US, UK South, West Europe4,096Sep 2021
gpt-35-turbo (0613)Australia East, Canada East, East US, East US 2, France Central, Japan East, North Central US, Sweden Central, Switzerland North, UK South4,096Sep 2021
gpt-35-turbo-16k (0613)Australia East, Canada East, East US, East US 2, France Central, Japan East, North Central US, Sweden Central, Switzerland North, UK South16,384Sep 2021
gpt-35-turbo-instruct (0914)East US, Sweden Central4,097Sep 2021
gpt-35-turbo (1106)Australia East, Canada East, France Central, South India, Sweden Central, UK South, West USInput: 16,385 Output: 4,096Sep 2021
1 This model will accept requests > 4,096 tokens. It is not recommended to exceed the 4,096 input token limit as the newer version of the model are capped at 4,096 tokens. If you encounter issues when exceeding 4,096 input tokens with this model this configuration is not officially supported.
Model Specifications
Last UpdatedSeptember 2024
PublisherOpenAI