Skip to content
AImpact
IT EN
Models Beginner Also known as: Attenzione · Self-attention

Attention

A mechanism that lets the model weigh how relevant each word in the text is compared to the others to understand the meaning of the context.

ShareLinkedInX

In practice

It is why an LLM knows that 'he' in a sentence refers to a person mentioned earlier. Compute cost grows with the square of context length: this is why very long contexts are expensive.

Related terms

← All terms