Technically
AI Reference
Your dictionary for AI terms like LLM and RLHF
Company Breakdowns
What technical products actually do and why the companies that make them are valuable
Learning Tracks
In-depth, networked guides to learning specific concepts
Posts Archive
All Technically posts on software concepts since the dawn of time
Terms Universe
The dictionary of software terms you've always wanted

Explore learning tracks

AI, it's not that ComplicatedAnalyzing Software CompaniesBuilding Software ProductsWorking with Data Teams
Loading...
I'm feeling luckyPricing
Log In

What is Generative AI?

A refresher on how models like ChatGPT and Stable Diffusion work under the hood.

Published Jul 22, 2025ai
Justin Gage
Justin Gage
Read within learning track:AI, it's not that complicated

If you’re reading this, you’ve probably already a decent grasp on what run of the mill Machine Learning is. But how do you go from a model that does something tractable, like predicting a stock price or identifying bugs in a cornfield, to a model that creates words from scratch? Or in other words…what is a Generative AI model actually doing?

First, I’m going to run through in general how GenAI works and how models create things. Then I’ll go into each of the two most popular types of GenAI – image generation and text generation – and cover each in a bit more depth.

Terms Mentioned

ChatGPT

Machine Learning

What is a Generative AI model doing?

Old school Machine Learning was all about prediction. You’d train a model on a dataset, and use it to predict what’s going to happen when new data enters the fray, like a new day in the stock market or a new image of some corn in your field.

Loading image...

Generative AI is also all about prediction. But instead of predicting a highly scoped, specific thing – like a number, or a yes/no answer – it predicts entire sentences, paragraphs, images, videos, or even audio. It’s trained to generate entire swaths of new data based on your prompts.

So even though the techniques and style have changed quite a bit over the past 5 years, GenAI and old school ML aren’t so far apart; they’re both learning patterns in data, and then using those patterns to do something.

Types of GenAI models

There are a bunch of different types of GenAI models, some of which have been around for a while. You’ve got Generative Adversarial Networks (GANs), where one model creates something and a sister model critiques it. You’ve got Variational Autoencoders (VAEs), which use probability distributions. Then you have Recurrent Neural Networks (RNNs), which is a special type of neural network that predicts sequences of words. There are even more with even longer acronyms.

But most of the advances over the past few years have come from two specific types of GenAI models:

  1. Transformers – mostly for text generation
  2. Diffusion models – mostly for image and video generation

Like any software system, each has things that it’s good at and things that it’s less good at. Let’s run through each type of model, where they came from, and how they work. In practice, many state of the art models today are using some combination of both.

Text generation models and transformers

Most of what’s wow-ing people today in GenAI is LLMs, or Large Language Models. ChatGPT, Claude, Gemini, and the like are all based on a specific type of model called a transformer.

Continue reading with an all-access subscription

Continue reading with all-access

In this post

  • Text generation models and transformers
  • Transformers and attention
  • How text generation models actually work
  • Image / video generation and diffusion models
  • Recap: What’s Generative AI?

More in this track

How do Large Language Models work?

Breaking down what ChatGPT and others are doing under the hood

What's GPT-3?

GPT-3 is a Machine Learning model that generates text.

$15/month

30-day money-back guarantee

Or use
Up Next
The beginner’s guide to AI model architecturesPaid Plan

Unlike an onion, hopefully these neural network layers won't make you cry.

A deep dive into MCP and its associated serversPaid Plan

How FastMCP makes it easy to ship an MCP server.

The scaling law and the “bitter lesson” of AIPaid Plan

How bigger models, more data, and more compute keep beating clever tricks.

Content
  • All Posts
  • Learning Tracks
  • AI Reference
  • Companies
  • Terms Universe
Company
  • Pricing
  • Sponsorships
  • Contact
Connect
SubscribeSubstackYouTubeXLinkedIn
Legal
  • Privacy Policy
  • Terms of Service

© 2026 Technically.