GPT (Generative Pre-trained Transformer)

GPT β€” short for Generative Pre-trained Transformer β€” is the family of large language models from OpenAI. It's the family that brought LLMs into the mainstream: the public launch of ChatGPT in late 2022 (built on a GPT model) is what turned LLMs from a research curiosity into an everyday tool used by hundreds of millions of people.

πŸ’‘ In one line: GPT is OpenAI's family of decoder-only LLMs β€” the models behind ChatGPT and the spark of the modern AI boom.

What is GPT?

GPT models are decoder-only Transformers (see the GPT subtopic in the Transformer series) trained with next-token prediction. They generate text autoregressively β€” one token at a time β€” and the larger versions are multimodal, handling images and audio alongside text. Each new generation has scaled up and added capabilities like deeper reasoning and tool use.

GPT vs. ChatGPT

A common point of confusion:

  • GPT = the underlying models (the AI).
  • ChatGPT = the assistant product that uses GPT models through a chat interface, with extra features like memory, tools, and search.

So GPT is the engine; ChatGPT is the car.

The GPT Lineage

GPT has evolved through several landmark generations:

  • GPT-1 (2018) β€” the original proof of concept.
  • GPT-2 (2019) β€” bigger, noticeably more fluent.
  • GPT-3 (2020) β€” huge scale-up; strong few-shot learning.
  • ChatGPT / GPT-3.5 (2022) β€” the mainstream breakthrough.
  • GPT-4 (2023) β€” much stronger reasoning, multimodal (image input).
  • GPT-4o (2024) β€” an "omni" model unifying text, audio, and image.
  • o-series (2024–25) β€” separate reasoning models that "think before answering."
  • GPT-5 (2025) β€” a unified system that pairs a fast model with a deeper reasoning model behind a router that decides how much to think.

As of mid-2026, the lineup is the GPT-5 generation (with rapid point releases like GPT-5.5 and a GPT-5.6 preview). The exact version names change often.

Key Characteristics (Current Generation)

  • Unified design β€” a fast path ("Instant") and a deeper reasoning path ("Thinking"), with a router choosing per request.
  • Tiers β€” typically an instant model, a reasoning model, and a top-end "Pro" variant.
  • Multimodal β€” text, images, and audio.
  • Large context windows and strong performance on coding, reasoning, and agentic tasks.

What GPT is Known For

  • Mainstreaming LLMs via ChatGPT.
  • General-purpose strength across writing, Q&A, and analysis.
  • Coding (including OpenAI's Codex coding agent).
  • A broad ecosystem of related models (image generation, video, speech).

Open or Closed?

GPT's flagship models are proprietary β€” accessed through ChatGPT or the OpenAI API, not downloadable. OpenAI has, however, released some open-weight models separately. (See the Open-source subtopic for how this contrasts with fully open families like Llama and DeepSeek.)

A Note on Versions

The GPT lineup changes rapidly β€” new versions, tiers, and capabilities arrive every few months. Treat specific version names as a snapshot; always check OpenAI's site for the current models.

Summary

  • GPT is OpenAI's family of decoder-only LLMs, behind ChatGPT.
  • It evolved from GPT-1 (2018) through GPT-3, GPT-4, and the GPT-5 generation (2025–26).
  • GPT = the models; ChatGPT = the product built on them.
  • The current generation unifies fast and reasoning models behind a router, and is multimodal.
  • Flagship GPT models are closed/proprietary (API + ChatGPT), and the lineup evolves quickly.