{"type":"link","version":"1.0","title":"Multi-resolution attention reaches the whole input in a single layer, where shifted-window attention needs many stacked layers to span the same distance","author_name":"AI Archs","author_url":"https://ai-arch.pages.dev","provider_name":"AI Archs","provider_url":"https://ai-arch.pages.dev","url":"https://ai-arch.pages.dev/n/global-receptive-field-every-layer-beats-windowed-attention-stacking","thumbnail_url":"https://ai-arch.pages.dev/android-chrome-512x512.png","thumbnail_width":512,"thumbnail_height":512}