-
Notifications
You must be signed in to change notification settings - Fork 27.7k
Pull requests: huggingface/transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix XGLM loss computation (PyTorch and TensorFlow)
#35878
opened Jan 24, 2025 by
damianoamatruda
Loading 1�7
Fix lost loss values when using user-defined compute_loss_func in some cases
#35872
opened Jan 24, 2025 by
dolphin-Dang
Loading 1�7
Add default TP plan for all models with backend support
#35870
opened Jan 24, 2025 by
Cyrilvallez
Loading 1�7
[docs] no hard coding cuda as bnb has multi-backend support
#35867
opened Jan 24, 2025 by
faaany
Loading 1�7
Fix device mismatch error in Whisper model during feature extraction
#35866
opened Jan 24, 2025 by
thedebugger
Loading 1�7
Add Tensor Parallel support for Gemma
#35864
opened Jan 23, 2025 by
kwen2501
Loading 1�7
4 of 5 tasks
Fix PaliGemma Pad Token Masking During Training #35855
#35859
opened Jan 23, 2025 by
sambhavnoobcoder
Loading 1�7
Add utility for Reload Transformers imports cache for development workflow #35508
#35858
opened Jan 23, 2025 by
sambhavnoobcoder
Loading 1�7
🚨🚨🚨 image-classification pipeline single-label and multi-label prob type squashing fns (sigmoid vs softmax) are backwards
bug
Core: Pipeline
Internals of the library; Pipeline.
Vision
#35848
opened Jan 22, 2025 by
rwightman
Loading 1�7
Fix Jitter Noise Passing to Experts in Switch Transformers #33969
#35847
opened Jan 22, 2025 by
sambhavnoobcoder
Loading 1�7
Nail in edge case of torch dtype being overriden permantly in the case of an error
#35845
opened Jan 22, 2025 by
muellerzr
Loading 1�7
1 of 5 tasks
Optimize Qwen2VL vision model by precomputing cos/sin embeds before ViT blocks
Multimodal
optimization
#35837
opened Jan 22, 2025 by
li-plus
Loading 1�7
1 of 5 tasks
fix(FA): QKV not being casted to target_dtype for FA with dpo lora
#35834
opened Jan 22, 2025 by
NanoCode012
Loading 1�7
1 of 5 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.