-
Updated
Nov 17, 2021 - Python
rocm
Here are 66 public repositories matching this topic...
Implement GPU version of numpy.* functions in cupy.* namespace.
This is a tracker issue that lists the remaining numpy.* APIs (see also: comparison table). I've categorized them based on difficulty so that new contributors can pick the right task. Your contribution is highly welcomed and appreciated!
List of A
-
Updated
Sep 11, 2018 - C++
Please go to https://github.com/deepmodeling/deepmd-kit/discussions to ask questions. Before asking questions, you can
- search the previous discussions
- check the document, especially training parameters
Please provide necessary information including the version of software and installation way,
-
Updated
Jul 26, 2021 - C++
Bug summary
There is evidence that sub_group::get_group_id() does not return the same value as threadIdx.x / warpSize (assuming 1D kernel), as expected on CUDA. We should check the implementation of this function. Our implementation of this function performs bit manipulation magic, presumably the optimization went to far...
To Reproduce
Compare sub_group{}.get_group_id() or `sub
-
Updated
Oct 21, 2021 - C
-
Updated
Feb 5, 2019 - C++
Since arrays may not actually be modified by a given operation, or might only be partially modified (or the user has some other way to ensure correctness).
-
Updated
Nov 17, 2021 - C++
-
Updated
Aug 4, 2021 - Go
-
Updated
Feb 5, 2019 - C++
-
Updated
Nov 4, 2021 - C++
-
Updated
Jun 16, 2020 - C++
-
Updated
Nov 17, 2021 - C++
-
Updated
Jun 26, 2019 - C++
-
Updated
Nov 5, 2021 - C++
-
Updated
Jul 31, 2020 - Shell
-
Updated
Sep 14, 2021
-
Updated
Nov 15, 2021 - C++
Improve this page
Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."
I see comments suggesting adding this to understand how loops are being handled by numba, and in the their own FAQ (https://numba.pydata.org/numba-doc/latest/user/faq.html)
You would then create your njit function and run it, and I believe the idea is that it prints debug information about whether