Technically
AI Reference
Your dictionary for AI terms like LLM and RLHF
Company Breakdowns
What technical products actually do and why the companies that make them are valuable
Learning Tracks
In-depth, networked guides to learning specific concepts
Posts Archive
All Technically posts on software concepts since the dawn of time
Terms Universe
The dictionary of software terms you've always wanted

Explore learning tracks

AI, it's not that ComplicatedAnalyzing Software CompaniesBuilding Software ProductsWorking with Data Teams
Loading...
I'm feeling luckyPricing
Log In
← Back to Universe

Pre-training

aiintermediate

When training an LLM, pre-training gives the model all the basic, foundational knowledge it needs to answer your prompts, by showing it lots and lots of examples of existing text from the internet.

We call this process the sentence re-arranging game. Take any sentence that exists in text, remove a word, and boom! You've got labeled training data. This is how LLMs are trained: by taking tons and tons of publicly available sentences, removing words, and teaching the model to replace them correctly:

  • I bought a stereo system to play my ______.
  • I bought a ______ to play my music.

After pre-training, a model is still not quite ready for prime time: responses will be long and rambly, and may not actually answer your question. This is why pre-training is followed by post-training, to refine the model into something we'd recognize as usable.

Read the full post ↗

How do you train an AI model?

A deep dive into how models like ChatGPT get built.

Read in the Knowledge Base →

Mentioned in

How do you train an AI model?

A deep dive into how models like ChatGPT get built.

Appliedai

Related terms

ChatGPT

LLM

Loss Function

Machine Learning

Post-training

Training

Impress your agents

70K+ PMs, engineers, investors, and operators read to Technically to expand their prompting vocabulary.

Content
  • All Posts
  • Learning Tracks
  • AI Reference
  • Companies
  • Terms Universe
Company
  • Pricing
  • Sponsorships
  • Contribute
  • Contact
Connect
SubscribeSubstackYouTubeXLinkedIn
Legal
  • Privacy Policy
  • Terms of Service

© 2026 Technically.