[Model] Pixtral Support #253

AndreSlavescu · 2024-09-18T05:03:47Z

Summary

This PR aims to support pixtral

Testing Done

tested model + tested monkey patch

Hardware Type: 3090
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

AndreSlavescu · 2024-09-18T05:05:59Z

Pixtral isn't yet fully supported in transformers library. PR pending release of Pixtral in transformers package.

lancerts · 2024-10-03T14:36:27Z

Exciting! @AndreSlavescu seems it is now supported in the transformer https://github.com/huggingface/transformers/tree/main/src/transformers/models/pixtral,
do you mind update the PR?

AndreSlavescu · 2024-10-03T17:41:41Z

Exciting! @AndreSlavescu seems it is now supported in the transformer https://github.com/huggingface/transformers/tree/main/src/transformers/models/pixtral, do you mind update the PR?

yes, I'll try to finish this either today or tomorrow

AndreSlavescu · 2025-05-27T03:04:02Z

@Tcc0403 pinging If you'd like to take a look.

Tcc0403 · 2025-05-27T06:31:06Z

src/liger_kernel/transformers/model/pixtral.py

I'm not familiar with pixtral but it looks like it's just a base model. The loss isn't computed in the forward pass, so there's no need to patch CrossEntropy and FusedLinearCrossEntropy.

Tcc0403 · 2025-05-27T07:22:38Z

test/convergence/bf16/test_mini_models_with_logits.py

+        else:
+            output = model(**batch)
+            loss = output.loss
+        loss.backward()


Actually, if we have to generate pixel value input just for this specific vision model, do we really want to support pure vision models in Liger Kernel? cc @lancerts @shivam15s @yundai424

If the answer is yes, then I think we should make another convergence test file for vision models to follow this type of workflow, generating pixel inputs and applying custom loss function.

Yes, I was thinking to implement a custom loss function for this, because patching with FusedLinearCrossEntropy won't work for this with the current API.

And yes, the main difficulty with integrating this with the current mini model tests is that it expects pixel inputs to the constructor of the PixtralVisionModel. So I have done a hacky solution for now.

yeah making model input a fixture or whatnot and loss function into something also customizable (all configured in mini model config) is a good idea 🤔

ByronHsu mentioned this pull request Sep 30, 2024

2024 Q4 Roadmap #285

Open

lancerts requested a review from ByronHsu October 2, 2024 00:19

pixtral initial

e7318fe

AndreSlavescu force-pushed the pixtral branch from 8789faf to e7318fe Compare May 22, 2025 01:44

progress

cf98362

lancerts previously approved these changes May 22, 2025

View reviewed changes

pixtral support

3d05cd9

AndreSlavescu dismissed lancerts’s stale review via 3d05cd9 May 22, 2025 05:24

bf16 tests

30a5940

AndreSlavescu requested a review from lancerts May 22, 2025 05:50

AndreSlavescu added 3 commits May 22, 2025 17:41

Merge branch 'main' into pixtral

69a9d1a

Merge branch 'main' into pixtral

ca34ab1

Merge branch 'main' into pixtral

5efc338

Tcc0403 reviewed May 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model] Pixtral Support #253

[Model] Pixtral Support #253

Uh oh!

AndreSlavescu commented Sep 18, 2024 •

edited

Loading

Uh oh!

AndreSlavescu commented Sep 18, 2024

Uh oh!

lancerts commented Oct 3, 2024

Uh oh!

AndreSlavescu commented Oct 3, 2024

Uh oh!

AndreSlavescu commented May 27, 2025

Uh oh!

Tcc0403 May 27, 2025

Uh oh!

Tcc0403 May 27, 2025

Uh oh!

AndreSlavescu May 27, 2025 •

edited

Loading

Uh oh!

yundai424 May 28, 2025

Uh oh!

Uh oh!

[Model] Pixtral Support #253

Are you sure you want to change the base?

[Model] Pixtral Support #253

Uh oh!

Conversation

AndreSlavescu commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing Done

Uh oh!

AndreSlavescu commented Sep 18, 2024

Uh oh!

lancerts commented Oct 3, 2024

Uh oh!

AndreSlavescu commented Oct 3, 2024

Uh oh!

AndreSlavescu commented May 27, 2025

Uh oh!

Tcc0403 May 27, 2025

Choose a reason for hiding this comment

Uh oh!

Tcc0403 May 27, 2025

Choose a reason for hiding this comment

Uh oh!

AndreSlavescu May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yundai424 May 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AndreSlavescu commented Sep 18, 2024 •

edited

Loading

AndreSlavescu May 27, 2025 •

edited

Loading