Skip to content

Pull requests: huggingface/transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or ⇄1�7 + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix XGLM loss computation (PyTorch and TensorFlow)
#35878 opened Jan 24, 2025 by damianoamatruda Loading 1�7
Fix model kwargs
#35875 opened Jan 24, 2025 by muellerzr Draft
5 tasks
Make cache traceable
#35873 opened Jan 24, 2025 by IlyasMoutawwakil Loading 1�7
5 tasks
fix gemma that needed kwargs
#35871 opened Jan 24, 2025 by ArthurZucker Draft
Add default TP plan for all models with backend support
#35870 opened Jan 24, 2025 by Cyrilvallez Loading 1�7
[docs] fix bugs in the bitsandbytes documentation
#35868 opened Jan 24, 2025 by faaany Loading 1�7
[docs] no hard coding cuda as bnb has multi-backend support
#35867 opened Jan 24, 2025 by faaany Loading 1�7
Add Tensor Parallel support for Gemma
#35864 opened Jan 23, 2025 by kwen2501 Loading 1�7
4 of 5 tasks
[doctest] Fixes
#35863 opened Jan 23, 2025 by stevhliu Loading 1�7
Add padding-free to bamba
#35861 opened Jan 23, 2025 by garrett361 Loading 1�7
5 tasks
Fix TP initialization
#35860 opened Jan 23, 2025 by Cyrilvallez Loading 1�7
Whisper: fix static cache CI
#35852 opened Jan 23, 2025 by zucchini-nlp Loading 1�7
Github action for auto-assigning reviewers
#35846 opened Jan 22, 2025 by Rocketknight1 Loading 1�7
Update-tp
#35844 opened Jan 22, 2025 by ArthurZucker Loading 1�7
Update modeling_attn_mask_utils.py
#35841 opened Jan 22, 2025 by Cwndmiao Loading 1�7
5 tasks
Fix PretrainedTokenizerFast check
#35835 opened Jan 22, 2025 by CL-ModelCloud Loading 1�7
fix(FA): QKV not being casted to target_dtype for FA with dpo lora
#35834 opened Jan 22, 2025 by NanoCode012 Loading 1�7
1 of 5 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.