HomeArchitectureGlossaryTags
Getting Started

Context Window

How far a model can attend within a sequence—local windows, sparse reach, or full dense attention over prior tokens.

Inference

Open search entry page

Module

  • Sliding-Window Attention

    An attention variant that restricts each query to a fixed local window of key positions instead of the full sequence.