Skip to content

Pull requests: Future-House/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

GRPO unwrap fix
#4 opened Feb 6, 2025 by sidnarayanan Loading…
pulling in diff from trl-2747
#3 opened Feb 3, 2025 by whitead Loading…
Apply attention mask when computing logprobs
#2 opened Feb 2, 2025 by sidnarayanan Loading…
Decoupling generation and loss batch sizes
#1 opened Feb 1, 2025 by sidnarayanan Loading…
ProTip! Exclude everything labeled bug with -label:bug.