Issues: microsoft/DeepSpeed
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] return getattr(args, f"{model_type[step_num]}_model")
bug
Something isn't working
compression
#3231
opened Apr 14, 2023 by
koalawangyang
[BUG] Fail to run the example in/DeepSpeedExamples
bug
Something isn't working
compression
#3229
opened Apr 14, 2023 by
wang700
[BUG] batch_size check failed with zero 2 (deepspeed v0.9.0)
bug
Something isn't working
training
#3228
opened Apr 14, 2023 by
chenmingjiong
[BUG] Installed CUDA version 12.1 does not match the version torch was compiled with 11.8
bug
Something isn't working
training
#3223
opened Apr 14, 2023 by
ggyggy666
[BUG]the following arguments are required: user_script, user_args
bug
Something isn't working
compression
#3222
opened Apr 14, 2023 by
TinyQi
[REQUEST] Please spend more time on the usability of the project, especially the doc.
enhancement
New feature or request
#3220
opened Apr 14, 2023 by
tbwork
[BUG] multi-node inference initialization fails when trying not to use replace_with_kernel_inject
bug
Something isn't working
inference
#3217
opened Apr 13, 2023 by
brevity2021
[BUG]Out of memory when training, and is streaming mode supported ?
bug
Something isn't working
training
#3214
opened Apr 13, 2023 by
wqw547243068
[BUG] Unable to pre-compile async_io
bug
Something isn't working
compression
#3211
opened Apr 13, 2023 by
littleshao
[BUG] NCCL out of memory on Something isn't working
training
save_checkpoint()
bug
#3210
opened Apr 13, 2023 by
fteufel
[BUG]RuntimeError: Step 1 exited with non-zero status 1
bug
Something isn't working
compression
#3208
opened Apr 13, 2023 by
ucas010
[BUG]error: can't copy 'deepspeed/accelerator': doesn't exist or not a regular file
bug
Something isn't working
compression
#3207
opened Apr 13, 2023 by
ucas010
[BUG] "with deepspeed.zero.Init()" is not idempotent
bug
Something isn't working
training
#3202
opened Apr 12, 2023 by
eisene
[BUG] error: use of undeclared identifier '__double2half'; did you mean '__double2hiint'?"
bug
Something isn't working
inference
#3197
opened Apr 12, 2023 by
WeiMa01
whl does not get created following the instructions on Windows 11 [BUG]
bug
Something isn't working
inference
#3196
opened Apr 12, 2023 by
acube3
cpu memory out of use when infering on 30b model
bug
Something isn't working
inference
#3192
opened Apr 12, 2023 by
frankxyy
[BUG] ds inference succeed for 2 gpus, oom for 4 gpus
bug
Something isn't working
inference
#3182
opened Apr 11, 2023 by
frankxyy
[BUG] Inference failed serveral times
bug
Something isn't working
inference
#3181
opened Apr 11, 2023 by
Quang-elec44
[BUG] Intermittent RuntimeError: The specified pointer resides on host memory and is not registered with any CUDA device.
bug
Something isn't working
inference
#3178
opened Apr 10, 2023 by
fweckesser
[BUG] Deepspeed inference fp16 gives different results than HuggingFace with FlanT5-XL
bug
Something isn't working
inference
#3177
opened Apr 10, 2023 by
brevity2021
[Deepspeed stage-3 student+teacher crash]
bug
Something isn't working
training
#3175
opened Apr 10, 2023 by
andrasiani
AssertionError: AutoTP not supported for model. Please use kernel injection since container policy for model exists.
bug
Something isn't working
inference
#3174
opened Apr 10, 2023 by
suri-kunal
Installing Ops for using with Pyinstaller
bug
Something isn't working
inference
#3173
opened Apr 10, 2023 by
Eichhof
Previous Next
ProTip!
Updated in the last three days: updated:>2023-04-11.