Allow subselecting the appropriate config for llama4 #1815

dakinggg · 2025-05-03T03:06:29Z

Multimodal models may return a config with subconfigs from AutoConfig. This PR allows subclasses to automatically select a subconfig (and sets up the causal LM training class to select text config). With this, llama4 scout finetuning works (llama4-scout-hf-real-7-a3JTUJ). llama4 also works with flex attention (llama4-scout-hf-flex-1-EUH7iy)

Along for the ride, install the new HF extra that supposedly speeds up download hf_xet, and bumps the minimum hf hub version that supports it.

llmfoundry/models/hf/hf_base.py

bowenyang008

LGTM

llmfoundry/models/hf/hf_base.py

irenedea

couple of nits! lgtm

llmfoundry/models/hf/hf_base.py

Co-authored-by: Irene Dea <[email protected]>

dakinggg added 12 commits April 30, 2025 17:14

try it

bd828d1

add xet

bd8131d

debug

abaea9c

gate

44f1645

try again

22671bc

revert

8a68c4a

once more

03384f5

no dev

5b11b63

debug

a5746af

remove print

86fe1a6

pc

c5b406f

pc

533e8fe

dakinggg marked this pull request as ready for review May 3, 2025 04:14

dakinggg requested review from a team as code owners May 3, 2025 04:14

dakinggg requested review from irenedea, ethantang-db and bowenyang008 May 3, 2025 04:14

dakinggg changed the title ~~Allow subselecting the appropriate config~~ Allow subselecting the appropriate config for llama4 May 3, 2025

bowenyang008 reviewed May 5, 2025

View reviewed changes

llmfoundry/models/hf/hf_base.py Outdated Show resolved Hide resolved

bowenyang008 reviewed May 5, 2025

View reviewed changes

llmfoundry/models/hf/hf_base.py Show resolved Hide resolved

bowenyang008 approved these changes May 5, 2025

View reviewed changes

dakinggg and others added 2 commits May 5, 2025 09:44

pr comments

27e2997

Merge branch 'main' into text-cfg

244b6cd

ethantang-db reviewed May 5, 2025

View reviewed changes

llmfoundry/models/hf/hf_base.py Outdated Show resolved Hide resolved

irenedea approved these changes May 5, 2025

View reviewed changes

llmfoundry/models/hf/hf_base.py Outdated Show resolved Hide resolved

llmfoundry/models/hf/hf_base.py Show resolved Hide resolved

llmfoundry/models/hf/hf_base.py Outdated Show resolved Hide resolved

pr comments

0d03274

dakinggg enabled auto-merge (squash) May 5, 2025 20:35

irenedea reviewed May 5, 2025

View reviewed changes

llmfoundry/models/hf/hf_base.py Outdated Show resolved Hide resolved

Update llmfoundry/models/hf/hf_base.py

9e31ca9

Co-authored-by: Irene Dea <[email protected]>

ethantang-db approved these changes May 5, 2025

View reviewed changes

dakinggg added 4 commits May 5, 2025 13:49

oops remove error. that was supposed to be silent

c2664a4

comment

bed2383

move comment

ed9a580

oops

60c7908

dakinggg merged commit 5c47350 into mosaicml:main May 5, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow subselecting the appropriate config for llama4 #1815

Allow subselecting the appropriate config for llama4 #1815

dakinggg commented May 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

bowenyang008 left a comment

Uh oh!

Uh oh!

irenedea left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Allow subselecting the appropriate config for llama4 #1815

Allow subselecting the appropriate config for llama4 #1815

Conversation

dakinggg commented May 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bowenyang008 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

irenedea left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dakinggg commented May 3, 2025 •

edited

Loading