GPT Models

GPT Models

The OpenAI API offers a variety of models, each with unique capabilities and varying price points, catering to different needs. Additionally, these models can be customized for specific use cases through fine-tuning.

OpenAI is excited to announce the preview release of GPT-4 Turbo, which boasts a 128k context window, and an updated GPT-3.5 Turbo, featuring a 16k context window. These models are enhanced with improved instruction following, a JSON mode, more reproducible outputs, and the capability for parallel function calling.

MODELDESCRIPTION
GPT-5GPT-5 Is under training as of today.
GPT-4 and GPT-4 TurboA set of models that improve on GPT-3.5 and can understand as well as generate natural language or code
GPT-3.5A set of models that improve on GPT-3 and can understand as well as generate natural language or code
GPT baseA set of models without instruction following that can understand as well as generate natural language or code

Continuous GPT Models Upgrades

OpenAI consistently upgrades its models to enhance performance and capabilities. Dev Team ensures that designations such as gpt-3.5-turbo, gpt-4, and gpt-4-32k always refer to the latest version of the respective models. The exact version used in a request can be confirmed by examining the response object, which includes details like ‘gpt-3.5-turbo-0613‘.

In addition to these continuously updated models, OpenAI also offers static model versions. These versions remain available for at least three months post the introduction of an updated model, providing developers with consistent performance during this period.

OpenAI is also introducing a new approach to model updates, inviting contributions from users through evals. These contributions are intended to help refine the models for various use cases. Interested individuals can participate by accessing the OpenAI Evals repository.

For users who wish to stay current with the latest advancements, OpenAI recommends using standard model names like gpt-4 or gpt-3.5-turbo. Meanwhile, temporary snapshot models, which have predetermined deprecation dates and their respective replacements, are also available for use until their deprecation.

MODEL NAMEDISCONTINUATION DATEREPLACEMENT MODEL
gpt-3.5-turbo-0613Jun 13, 2024gpt-3.5-turbo-1106
gpt-3.5-turbo-0301Jun 13, 2024gpt-3.5-turbo-1106

Learn more about model deprecation on our deprecation page.

GPT-5

GPT-5 Model is under training as of today.

GPT-4 and GPT-4 Turbo

GPT-4 is a large multimodal model capable of processing both text and image inputs and delivering text outputs. This model outperforms all previous models in terms of accuracy, attributed to its expansive general knowledge base and enhanced reasoning abilities. GPT-4 is accessible through the OpenAI API for paying customers.

Similar to gpt-3.5-turbo, GPT-4 is optimized for conversational applications but also excels in traditional completion tasks when used via the Chat Completions API. Users can learn more about utilizing GPT-4 effectively by referring to the comprehensive GPT guide provided by OpenAI.

MODELDESCRIPTIONCONTEXT WINDOWTRAINING DATA
gpt-4-1106-previewNew GPT-4 Turbo
The latest GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Returns a maximum of 4,096 output tokens. This preview model is not yet suited for production traffic. Learn more.
128,000 tokensUp to Apr 2023
gpt-4-vision-previewNew GPT-4 Turbo with vision
Ability to understand images, in addition to all other GPT-4 Turbo capabilties. Returns a maximum of 4,096 output tokens. This is a preview model version and not suited yet for production traffic. Learn more.
128,000 tokensUp to Apr 2023
gpt-4Currently points to gpt-4-0613. See continuous model upgrades.8,192 tokensUp to Sep 2021
gpt-4-32kCurrently points to gpt-4-32k-0613. See continuous model upgrades.32,768 tokensUp to Sep 2021
gpt-4-0613Snapshot of gpt-4 from June 13th 2023 with improved function calling support.8,192 tokensUp to Sep 2021
gpt-4-32k-0613Snapshot of gpt-4-32k from June 13th 2023 with improved function calling support.32,768 tokensUp to Sep 2021

While GPT-4 and GPT-3.5 models may perform similarly on many basic tasks, GPT-4 demonstrates a marked improvement in more complex reasoning scenarios. It significantly outperforms all previous models in these situations, showcasing its advanced capabilities in handling intricate and demanding tasks.

Multilingual capabilities

GPT-4 represents a significant leap forward in the field of large language models, surpassing both its predecessors and most state-of-the-art systems as of 2023. These competing systems often rely on benchmark-specific training or meticulous hand-engineering. In the context of the MMLU benchmark, an English-language suite comprised of multiple-choice questions across 57 subjects, GPT-4’s superiority is evident.

It not only considerably outperforms existing models in English but also exhibits robust performance in various other languages, highlighting its versatility and advanced language processing capabilities.

GPT-3.5 Models

The GPT-3.5 models are designed to understand and generate both natural language and code effectively. Within the GPT-3.5 family, the most capable and cost-effective model is gpt-3.5-turbo. This model has been specially optimized for chat-based applications using the Chat Completions API. However, it also performs admirably in traditional completion tasks, showcasing its versatility and efficiency in various language processing scenarios.

MODELDESCRIPTIONCONTEXT WINDOWTRAINING DATA
gpt-3.5-turbo-1106New Updated GPT 3.5 Turbo
The latest GPT-3.5 Turbo model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Returns a maximum of 4,096 output tokens. Learn more.
16,385 tokensUp to Sep 2021
gpt-3.5-turboCurrently points to gpt-3.5-turbo-0613.4,096 tokensUp to Sep 2021
gpt-3.5-turbo-16kCurrently points to gpt-3.5-turbo-0613.16,385 tokensUp to Sep 2021
gpt-3.5-turbo-instructSimilar capabilities as GPT-3 era models. Compatible with legacy Completions endpoint and not Chat Completions.4,096 tokensUp to Sep 2021
gpt-3.5-turbo-0613Legacy Snapshot of gpt-3.5-turbofrom June 13th 2023. Will be deprecated on June 13, 2024.4,096 tokensUp to Sep 2021
gpt-3.5-turbo-16k-0613Legacy Snapshot of gpt-3.5-16k-turbo from June 13th 2023. Will be deprecated on June 13, 2024.16,385 tokensUp to Sep 2021
gpt-3.5-turbo-0301Legacy Snapshot of gpt-3.5-turbofrom March 1st 2023. Will be deprecated on June 13th 2024.4,096 tokensUp to Sep 2021

We recommend using gpt-3.5-turbo over the other GPT-3.5 models because of its lower cost and improved performance.

GPT Base

GPT base models are adept at understanding and generating both natural language and code. However, unlike some of their advanced counterparts, they are not specifically trained for instruction following. These base models serve as replacements for the original GPT-3 base models and are designed to work with the legacy Completions API.

For most users, it is recommended to opt for the GPT-3.5 or GPT-4 models. These newer models offer enhanced capabilities, including improved instruction following, making them more suitable for a wider range of applications and more complex tasks.

GPT Store

More than 3 millions GPTs are available in GPT Store.

Read more articles about GPT Models in our Blog.