Skip to content
AImpact
IT EN
Models Advanced Also known as: Rotary Position Embedding · Embedding posizionale rotatorio

RoPE

/rope/

A positional encoding technique that rotates token vectors based on their position, baking order directly into attention.

ShareLinkedInX

In practice

It has become the de facto standard: Llama, Mistral, Qwen, DeepSeek, and GPT-4 class models all use it. It lets you stretch the context beyond the training length with tricks like NTK-aware or YaRN. If you fine-tune on long contexts, understanding RoPE is almost mandatory.

Related terms

← All terms