Training Beginner Also known as: Affinamento · Adattamento

Fine-tuning

An extra training step where a ready-made model is trained on a smaller, more specific dataset to improve its performance on a certain task or domain.

ShareLinkedIn X

In practice

You do it when the base model does not match the style, jargon, or formats you need. It requires good labeled data and GPUs. Often you start with a lightweight variant like LoRA before doing a full fine-tune.

Related terms

LoRA Foundation model RLHF

Seen in the wild

11 entries mentioning it

August 20, 2024

bitsandbytes 0.43: QLoRA and NF4/FP4 quantization for 4-bit fine-tuning

Medium
July 16, 2024

Databricks Mosaic AI: unified fine-tuning and inference on the data lakehouse

Medium
March 18, 2024

S-LoRA and Punica: serving hundreds of LoRA fine-tunings from a single base model

High
September 14, 2023

Backdoors in fine-tuned LLMs: hidden behaviors activatable on command

High
June 14, 2023

WizardLM: GPT-4-evolved instructions for fine-tuning

Medium
June 5, 2023

Gorilla: fine-tuned LLaMA that calls APIs without errors

Medium
April 16, 2023

Vicuna-13B: the open chatbot that reaches 90% of ChatGPT quality

High
October 25, 2022

Textual Inversion: inject a custom concept into diffusion models

Medium
August 25, 2022

DreamBooth: generate your subject in any style with 3-5 photos

High
January 27, 2022

InstructGPT: the fine-tuning that teaches GPT to obey

High
October 21, 2021

FLAN: instruction tuning that teaches models to follow directions

High

← All terms