Context WindowHow far a model can attend within a sequence—local windows, sparse reach, or full dense attention over prior tokens.InferenceSearch this tagcontext-windowOpen search entry pageModuleSliding-Window AttentionAn attention variant that restricts each query to a fixed local window of key positions instead of the full sequence.