Skip to content
AImpact
IT EN
Inference Intermediate Also known as: NIAH · Ago nel pagliaio

Needle in a Haystack

A test that hides a specific sentence inside a long irrelevant text and asks the model to retrieve it, to measure the real quality of the context window.

ShareLinkedInX

In practice

It has become the de facto benchmark for long-context models (100K, 1M tokens). A model can advertise a huge context but fail NIAH beyond a certain depth, a sign the window is effectively 'fake'.

Related terms

Seen in the wild

0 entries mentioning it

No archive entry mentions it explicitly. Appears in broader contexts.

← All terms