Pull requests: microsoft/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Reset KV-cache at the beginning of text-generation
#2669
opened Jan 4, 2023 by
RezaYazdaniAminabadi
Loading…
Fix INT8-quantization for BLOOM, OPT, and Neo-X
#2662
opened Jan 2, 2023 by
RezaYazdaniAminabadi
Loading…
DeepSpeedZeroOptimizer dist.reduce optimization when self.round_robin_gradients is false
#2581
opened Dec 7, 2022 by
Liangliang-Ma
Loading…
Fix bugs in exp_count and add a boolean flag on whether to use both masks in calculating l_aux
#2559
opened Dec 1, 2022 by
yyyyyt123
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.