Training Intermediate Also known as: Distillazione

Distillation

A technique to train a small model to mimic the behavior of a large one, getting similar quality at a fraction of inference cost.

ShareLinkedIn X

In practice

It is why we keep getting small but capable models: they are distilled from frontier ones. If you need fast cheap responses in a narrow domain, distilling your own model from Claude or GPT is often the winning move.

Related terms

Fine-tuning Quantization

Seen in the wild

3 entries mentioning it

February 1, 2025

s1: 1000 examples and a prompt trick to replicate a reasoning model

High
June 27, 2024

Gemma 2: Google's second-gen open model with Gemini distillation

High
October 25, 2023

Latent Consistency Models: real-time image generation in 4 steps

High

← All terms