Skip to content
#

CUDA

cuda logo

CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.

Here are 3,395 public repositories matching this topic...

numba
rhjmoore
rhjmoore commented Sep 1, 2021

I see comments suggesting adding this to understand how loops are being handled by numba, and in the their own FAQ (https://numba.pydata.org/numba-doc/latest/user/faq.html)

from llvmlite import binding as llvm
llvm.set_option('','--debug-only=loop-vectorize')

You would then create your njit function and run it, and I believe the idea is that it prints debug information about whether

ayulockin
ayulockin commented Dec 1, 2021

I am working on creating a WandbCallback for Weights and Biases. I am glad that CatBoost has a callback system in place but it would be great if we can extend the interface.

The current callback only supports after_iteration that takes info. Taking inspiration from XGBoost callback system it would be great if we can have before iteration that takes info, before_training, and `after

rogerallen
rogerallen commented Feb 20, 2022

Description

Calling vectorize with a non-None value for the signature parameter outputs this error message about the excluded parameter.

NotImplementedError: cupy.vectorize does not support `excluded` option currently.

Inspecting the code, it is obvious there is a copy-paste error and the 2nd error message should be change excluded to signature.

https://github.com/cupy/c

bdice
bdice commented Feb 3, 2022

Is your feature request related to a problem? Please describe.
While reviewing PR #9817 to introduce DataFrame.diff, I noticed that it is restricted to acting on numeric types.

A time-series diff is probably a very common user need, if provided a series of timestamps and seeking the durations between observations.

Pandas supports diffs on non-numeric types like timestamps:

thrust
oneflow
tattain404
tattain404 commented Feb 22, 2022

Summary

I run the code from the tutorial https://docs.oneflow.org/master/parallelism/05_ddp.html
for 通过设置 SBP 做数据并行训练, but it turns out with

'MobileNetV2' object has no attribute 'to_global'

i also try define NeuralNetwork class using class NeuralNetwork(nn.Module):
and model = NeuralNetwork().to(DEVICE)
then try to use model.to_global to allocate the model to GPU clusters, but it

wphicks
wphicks commented Feb 8, 2021

Report needed documentation

Report needed documentation
While the estimator guide offers a great breakdown of how to use many of the tools in api_context_managers.py, it would be helpful to have information right in the docstring during development to more easily understand what is actually going on in each of the provided functions/classes/methods. This is particularly important for

Created by Nvidia

Released June 23, 2007

Website
developer.nvidia.com/cuda-zone
Wikipedia
Wikipedia

Related Topics

nvcc