Models Beginner Also known as: SLM · Small LLM

Small Language Model

A Small Language Model (SLM) is a language model in the 1B-7B parameter range, optimized to maximize quality-per-parameter rather than raw capability. The key insight from Microsoft's Phi series is that training on 'textbook quality' synthetic data enables a 1.3B model to rival much larger models on reasoning benchmarks. SLMs run on laptops, smartphones, and embedded devices without a dedicated GPU. Representative examples include Phi-1.5, Phi-3, Gemma 2B, Qwen 1.5B, and SmolLM.

ShareLinkedIn X

In practice

A developer chooses an SLM when deploying an AI assistant on edge hardware (Raspberry Pi, Android phone, corporate laptop) where a 70B LLM would be impractical. With llama.cpp or Ollama, a 4-bit quantized Phi-3 Mini runs at acceptable speed on any modern CPU. SLMs are also ideal for specialized tasks: fine-tuning on a specific domain with limited data produces compact models that outperform GPT-4 in that target domain.

Seen in the wild

5 entries mentioning it

← All terms

In practice

Related terms

Seen in the wild