Update DPO Max Seq Len #2176

pbontrager · 2024-12-18T21:36:49Z

Context

What is the purpose of this PR? Is it to

add a new feature
fix a bug
update tests and/or documentation
other (please add here)

Default value for 8.1 updated to match llama2. Without truncating the datasets long sequences cause OOM.

Changelog

llama3.1 8B dpo lora max seq len in config

Test plan

Please make sure to do each of the following if applicable to your PR. If you're unsure about any one of these just ask and we will happily help. We also have a contributing page for some guidance on contributing.

run pre-commit hooks and linters (make sure you've first installed via pre-commit install)
add unit tests for any new functionality
update docstrings for any new or updated methods or classes
run unit tests via pytest tests
run recipe tests via pytest tests -m integration_test
manually run any new or modified recipes with sufficient proof of correctness
include relevant commands and any other artifacts in this summary (pastes of loss curves, eval results, etc.)

tune run lora_dpo_single_device --config llama3_1/8B_lora_dpo_single_device
tune run --nproc_per_node 2 lora_dpo_distributed --config llama3_1/8B_lora_dpo

pytorch-bot · 2024-12-18T21:36:52Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/2176

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d04db5c with merge base 3518492 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

joecummings · 2024-12-19T10:11:07Z

recipes/configs/llama3_1/8B_lora_dpo_single_device.yaml

@@ -31,7 +31,7 @@ model:
 tokenizer:
  _component_: torchtune.models.llama3.llama3_tokenizer
  path: /tmp/Meta-Llama-3.1-8B-Instruct/original/tokenizer.model
-  max_seq_len: null


Shouldn't this be like 8K? B/c the default dataset has an average seq len > 7K

update max seq len

6f994f6

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 18, 2024

felipemello1 approved these changes Dec 18, 2024

View reviewed changes

added comments

d04db5c

joecummings reviewed Dec 19, 2024

View reviewed changes

felipemello1 merged commit 74e6e7b into pytorch:main Dec 20, 2024
17 checks passed

felipemello1 pushed a commit that referenced this pull request Dec 20, 2024

Update DPO Max Seq Len (#2176)

bbff764

mori360 pushed a commit to mori360/torchtune that referenced this pull request Dec 20, 2024

Update DPO Max Seq Len (pytorch#2176)

5edb449

rahul-sarvam pushed a commit to sarvamai/torchtune that referenced this pull request Dec 23, 2024

Update DPO Max Seq Len (pytorch#2176)

518d964

rahul-sarvam pushed a commit to sarvamai/torchtune that referenced this pull request Dec 23, 2024

Update DPO Max Seq Len (pytorch#2176)

4422594

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update DPO Max Seq Len #2176

Update DPO Max Seq Len #2176

Uh oh!

pbontrager commented Dec 18, 2024

Uh oh!

pytorch-bot bot commented Dec 18, 2024 •

edited

Loading

Uh oh!

joecummings Dec 19, 2024

Uh oh!

Uh oh!

Uh oh!

Update DPO Max Seq Len #2176

Update DPO Max Seq Len #2176

Uh oh!

Conversation

pbontrager commented Dec 18, 2024

Context

Changelog

Test plan

Uh oh!

pytorch-bot bot commented Dec 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/2176

✅ No Failures

Uh oh!

joecummings Dec 19, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 18, 2024 •

edited

Loading