Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: invalidate ZeRO-3 param coordinator trace in add_hooks
#4693 opened Dec 15, 2025 by roycho96 Loading…
1 of 5 tasks
Fix KeyError with transformers 5.0.0+ where push_to_hub_token is removed
#4691 opened Dec 14, 2025 by Manodeepray Loading…
3 tasks done
Fix typos
#4690 opened Dec 14, 2025 by qgallouedec Loading…
feat: DeepSeek V3.2 Off-policy sequence masking
#4689 opened Dec 13, 2025 by casinca Draft
5 tasks
GKDTrainer: Fix return_outputs in Liger kernel path and update tests
#4688 opened Dec 13, 2025 by roycho96 Loading…
2 of 5 tasks
Align stable trainers
#4687 opened Dec 12, 2025 by qgallouedec Loading…
5 tasks
Align GRPO and RLOO initialization
#4685 opened Dec 12, 2025 by qgallouedec Loading…
Align import utils with transformers
#4684 opened Dec 12, 2025 by qgallouedec Loading…
Move get_reward function to experimental.utils
#4683 opened Dec 12, 2025 by qgallouedec Loading…
5 tasks
loss calculation for evaluation without training
#4673 opened Dec 11, 2025 by SonuDixit Loading…
5 tasks
Update import structure
#4665 opened Dec 11, 2025 by qgallouedec Draft
Add GRPO QLoRA free notebook
#4660 opened Dec 10, 2025 by sergiopaniego Draft
5 tasks
[WIP] GRPO-inspired Online DPO refactor
#4659 opened Dec 10, 2025 by d-tiapkin Draft
2 of 7 tasks
feature: Add RTPO Trainer
#4652 opened Dec 9, 2025 by SolarWindRider Loading…
6 tasks done
Set version to packaged one in notebooks
#4648 opened Dec 9, 2025 by sergiopaniego Loading…
5 tasks
Preserve truncated tokens in BFD packing
#4632 opened Dec 5, 2025 by qgallouedec Loading…
Update docs landing with latest details
#4624 opened Dec 4, 2025 by sergiopaniego Loading…
6 tasks
Add PSPO trust region method as alternative to clipping in GRPOTrainer
#4548 opened Nov 19, 2025 by MCDwyer Loading…
2 of 5 tasks
Add compute_metrics parameter for GRPOTrainer
#4534 opened Nov 17, 2025 by colinzhaoxp Loading…
ProTip! What’s not been updated in a month: updated:<2025-11-14.