Inference Intermediate Also known as: NIAH · Ago nel pagliaio

Needle in a Haystack

A test that hides a specific sentence inside a long irrelevant text and asks the model to retrieve it, to measure the real quality of the context window.

ShareLinkedIn X

In practice

It has become the de facto benchmark for long-context models (100K, 1M tokens). A model can advertise a huge context but fail NIAH beyond a certain depth, a sign the window is effectively 'fake'.

Related terms

Context window Lost in the middle

Seen in the wild

0 entries mentioning it

No archive entry mentions it explicitly. Appears in broader contexts.

← All terms