-
Updated
Nov 13, 2020 - Python
rocm
Here are 50 public repositories matching this topic...
-
Updated
Sep 11, 2018 - C++
-
Updated
Nov 8, 2020 - C++
Just an FYI whilst I was trawling through the ROCm GitHub page:
https://rocmdocs.amd.com/en/latest/Programming_Guides/Programming-Guides.html#
-
Updated
Feb 5, 2019 - C++
-
Updated
Nov 10, 2020 - Python
-
Updated
Feb 5, 2019 - C++
-
Updated
Sep 25, 2020 - C++
-
Updated
Nov 13, 2020 - C++
-
Updated
Jun 16, 2020 - C++
-
Updated
Apr 30, 2020 - Go
-
Updated
Nov 13, 2020 - C++
-
Updated
Jun 26, 2019 - C++
-
Updated
Aug 18, 2020 - C++
-
Updated
Jul 31, 2020 - Shell
-
Updated
May 22, 2020 - C++
-
Updated
Oct 29, 2020 - C++
-
Updated
Oct 14, 2017 - Chapel
Improve this page
Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."
PR #6447 adds a public API to get the maximum number of registers per thread (
numba.cuda.Dispatcher.get_regs_per_thread()). There are other attributes that might be nice to provide - shared memory per block, local memory per thread, const memory usage, maximum block size.These are all available in the
FuncAttrnamed tuple: https://github.com/numba/numba/blob/master/numba/cuda/cudadrv/drive