Adding support for nope positional encoding in block overrides. #1794

ShashankMosaicML · 2025-04-17T21:19:04Z

Allows for certain layers to have no positional embedding, similar to Llama 4. Example usage:

block_overrides:
  order:
  - name: default
  - name: nope_layer
  overrides:
    nope_layer:
      attn_config:
        nope: True

llmfoundry/models/mpt/configuration_mpt.py

Copilot

Pull Request Overview

This PR introduces a new flag ("nope") to disable positional encoding via block overrides, providing users with greater flexibility in specifying positional encoding behavior.

Added tests to verify attention behavior with the "nope" flag.
Updated configuration defaults and validation in the MPT configuration to restrict "nope" usage to block overrides.
Propagated the "nope" flag through attention layer constructors and forward logic to conditionally disable rotary embeddings and alibi slopes.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

File	Description
tests/models/layers/test_flash_torch.py	Added tests covering cases with "nope" enabled and disabled.
llmfoundry/models/utils/config_defaults.py	Added default "nope": False setting in configuration defaults.
llmfoundry/models/mpt/configuration_mpt.py	Enforced "nope" usage only in block overrides with validation.
llmfoundry/models/layers/attention.py	Propagated "nope" flag across attention layer initialization and forward pass.

Comments suppressed due to low confidence (3)

llmfoundry/models/layers/attention.py:464

[nitpick] The parameter name 'nope' is not self-explanatory; consider renaming it to something more descriptive like 'disable_positional_encoding' to improve code clarity.

nope: bool = False,

llmfoundry/models/mpt/configuration_mpt.py:214

[nitpick] The raised error message when 'nope' is specified as a default indicates that it must only be used in block_overrides; please ensure this behavior is clearly documented for users.

if self.attn_config.get('nope', False):

tests/models/layers/test_flash_torch.py:821

[nitpick] Consider adding an explicit assertion verifying that positional encoding is disabled when 'nope' is set to True, to ensure future changes do not unintentionally alter this behavior.

cfg.nope = True

tests/models/layers/test_flash_torch.py

dakinggg

Please add a PR description with an example of how to use this

llmfoundry/models/mpt/configuration_mpt.py

Co-authored-by: Vitaliy Chiley <[email protected]>

ShashankMosaicML and others added 4 commits April 17, 2025 14:02

..

bc791cc

..

31c9633

Merge branch 'main' into shashank/support_NoPE

eef8304

..

42c1c17

ShashankMosaicML marked this pull request as ready for review April 24, 2025 01:25

ShashankMosaicML requested a review from a team as a code owner April 24, 2025 01:25

..

e0cc365

ShashankMosaicML changed the title ~~Adding for nope positional encoding in block overrides.~~ Adding support for nope positional encoding in block overrides. Apr 24, 2025

vchiley reviewed Apr 24, 2025

View reviewed changes

llmfoundry/models/mpt/configuration_mpt.py Outdated Show resolved Hide resolved

vchiley reviewed Apr 24, 2025

View reviewed changes

llmfoundry/models/mpt/configuration_mpt.py Outdated Show resolved Hide resolved

vchiley reviewed Apr 24, 2025

View reviewed changes

llmfoundry/models/mpt/configuration_mpt.py Outdated Show resolved Hide resolved

..

326b19c

ShashankMosaicML requested a review from ai-reviewer April 24, 2025 20:12

dakinggg requested a review from Copilot April 25, 2025 17:05

Copilot AI reviewed Apr 25, 2025

View reviewed changes

dakinggg reviewed Apr 25, 2025

View reviewed changes

tests/models/layers/test_flash_torch.py Outdated Show resolved Hide resolved

ShashankMosaicML and others added 6 commits April 25, 2025 15:34

..

b723665

..

fcb9cfa

..

0d8946d

..

d1bf33d

..

f919f13

Merge branch 'main' into shashank/support_NoPE

cbcae12

dakinggg approved these changes Apr 28, 2025

View reviewed changes

vchiley reviewed Apr 28, 2025

View reviewed changes

llmfoundry/models/mpt/configuration_mpt.py Outdated Show resolved Hide resolved

vchiley reviewed Apr 28, 2025

View reviewed changes

llmfoundry/models/mpt/configuration_mpt.py Show resolved Hide resolved

ShashankMosaicML and others added 3 commits April 28, 2025 18:15

Update llmfoundry/models/mpt/configuration_mpt.py

d907d90

Co-authored-by: Vitaliy Chiley <[email protected]>

..

8eab59e

Merge branch 'main' into shashank/support_NoPE

2a38274

ShashankMosaicML merged commit 1a8776c into mosaicml:main Apr 29, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding support for nope positional encoding in block overrides. #1794

Adding support for nope positional encoding in block overrides. #1794

Uh oh!

ShashankMosaicML commented Apr 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

dakinggg left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Adding support for nope positional encoding in block overrides. #1794

Adding support for nope positional encoding in block overrides. #1794

Uh oh!

Conversation

ShashankMosaicML commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

dakinggg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ShashankMosaicML commented Apr 17, 2025 •

edited

Loading