In practice
You do it when the base model does not match the style, jargon, or formats you need. It requires good labeled data and GPUs. Often you start with a lightweight variant like LoRA before doing a full fine-tune.
Related terms
Seen in the wild
11 entries mentioning it- Mediumbitsandbytes 0.43: QLoRA and NF4/FP4 quantization for 4-bit fine-tuning
- MediumDatabricks Mosaic AI: unified fine-tuning and inference on the data lakehouse
- HighS-LoRA and Punica: serving hundreds of LoRA fine-tunings from a single base model
- HighBackdoors in fine-tuned LLMs: hidden behaviors activatable on command
- MediumWizardLM: GPT-4-evolved instructions for fine-tuning
- MediumGorilla: fine-tuned LLaMA that calls APIs without errors
- HighVicuna-13B: the open chatbot that reaches 90% of ChatGPT quality
- MediumTextual Inversion: inject a custom concept into diffusion models
- HighDreamBooth: generate your subject in any style with 3-5 photos
- HighInstructGPT: the fine-tuning that teaches GPT to obey
- HighFLAN: instruction tuning that teaches models to follow directions