Skip to content
AImpact
IT EN
Training Beginner Also known as: Pre-training · Pre-addestramento

Pretraining

The initial training phase where a model learns the structure of language by predicting the next token on huge amounts of generic text.

ShareLinkedInX

In practice

It is the most expensive step (months of GPUs and millions of dollars) and produces a "base" model that can write but cannot yet follow instructions. Only big labs run it from scratch; companies start from pretrained models and adapt them with SFT, LoRA, or RLHF.

Related terms

← All terms