Skip to content

[BUG] Deepspeed and RLHF causes error #529

Closed as not planned
Closed as not planned
@pascal-pfeiffer

Description

@pascal-pfeiffer

🐛 Bug

Running the default RLHF experiment with Deepspeed will fail with the following error:

Traceback (most recent call last):
File ".../train_wave.py", line 106, in
run(cfg=cfg)
File ".../train.py", line 948, in run
val_loss, val_metric = train_function(
File ".../train.py", line 484, in run_train_rlhf
train_dataloader.sampler, "set_epoch"
AttributeError: 'DeepSpeedDataLoader' object has no attribute 'sampler'

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions