New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
[BUG] network crashes with stage=3 & dynamically-changed network
bug
#1843
opened Mar 18, 2022 by
amsword
[REQUEST] Why ep_size should be less than world_size?
enhancement
#1838
opened Mar 16, 2022 by
Satan012
[BUG]AttributeError: 'DeepSpeedEngine' object has no attribute 'quantizer'
bug
#1837
opened Mar 16, 2022 by
AQA6666
[REQUEST] ZeRO-Infinity: GPU Memory Usage Higher Than Expected
enhancement
#1831
opened Mar 14, 2022 by
kiehls90
[BUG] Same train time with DeepSpeed (despite increased batch size)
bug
#1825
opened Mar 11, 2022 by
pminhyung
Can deepspeed be used to infer fairseq trained gpt models?
enhancement
#1824
opened Mar 11, 2022 by
dingjingzhen
[BUG] deepspeed_aio_handle_t::_stop_threads(): Assertion `0 == _num_pending_ops' failed.
bug
#1813
opened Mar 6, 2022 by
JIN-096
[BUG] lr scheduler get_last_lr() does work with fp16 enabled
bug
#1808
opened Mar 4, 2022 by
mbetser
[BUG] ZeRO/bf16 grad accumulation in bf16 needs higher precision accumulator
bug
#1800
opened Mar 1, 2022 by
stas00
[BUG] DeepSpeed Inference with GPT-J using batches with padding gives wrong outputs
bug
#1797
opened Feb 27, 2022 by
tomerip
[BUG] IndexError / Runtime Error with
torch.nn.TransformerEncoder
bug
#1795
opened Feb 26, 2022 by
floatshadow
Previous Next
ProTip!
Follow long discussions with comments:>50.