Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix masking of response tokens
#1718 opened Jun 9, 2024 by mertsayar8 Loading…
adds AOT
#1701 opened Jun 5, 2024 by imelnyk Loading…
Visual DPO
#1647 opened May 17, 2024 by qgallouedec Draft
Prototype Dataset Processor
#1646 opened May 16, 2024 by vwxyzjn Draft
[DRAFT] Vllm integration
#1628 opened May 7, 2024 by vwxyzjn Draft
Integrate f-divergence to DPO (Follow up)
#1610 opened May 1, 2024 by 1485840691 Loading…
Minimal examples
#1603 opened Apr 30, 2024 by vwxyzjn Draft
Added DataCollatorForMultiTurnCompletions
#1592 opened Apr 26, 2024 by AswanthManoj Loading…
[WIP] Unify Policy Trainers
#1586 opened Apr 25, 2024 by lapp0 Draft
4 tasks
Added Reward Backpropogation Support
#1585 opened Apr 25, 2024 by mihirp1998 Loading…
A pull request for POVIDTrainer
#1573 opened Apr 23, 2024 by gzcch Loading…
ProTip! Exclude everything labeled bug with -label:bug.