-
Updated
Jul 8, 2022 - Python
#
rocm
Here are 67 public repositories matching this topic...
Open deep learning compiler stack for cpu, gpu and specialized accelerators
javascript
machine-learning
performance
deep-learning
metal
compiler
gpu
vulkan
opencl
tensor
spirv
rocm
tvm
-
Updated
Sep 11, 2018 - C++
A deep learning package for many-body potential energy representation and molecular dynamics
-
Updated
Jul 7, 2022 - C++
stdgpu: Efficient STL-like Data Structures on the GPU
cpp
gpu
modern-cpp
cpp14
openmp
cuda
stl
data-structures
gpgpu
gpu-acceleration
cpp17
stl-containers
hip
gpu-computing
rocm
cpp20
stl-like
-
Updated
Jun 22, 2022 - C++
tomdeakin
commented
Jun 8, 2020
Just an FYI whilst I was trawling through the ROCm GitHub page:
https://rocmdocs.amd.com/en/latest/Programming_Guides/Programming-Guides.html#
Agenium Scale vectorization library for CPUs and GPUs
hpc
neon
cuda
avx
simd
avx2
sse2
simd-programming
aarch64
avx512
simd-instructions
simd-library
sse42
rocm
cpp20
sve
neon128
cpp20-library
vectorization-library
-
Updated
Oct 21, 2021 - C
jpsamaroo
commented
Apr 6, 2021
Since arrays may not actually be modified by a given operation, or might only be partially modified (or the user has some other way to ensure correctness).
AMD OpenVX Core -- a sub-module of amdovx-modules:
linux
cmake
cpu
opencl
range
vcxproj
amdgpu
rocm
radeon-open-compute
openvx
radeon-instinct-mi-series
radeon-vega-series
amd-openvx
khronos-openvx
vx-loomsl
-
Updated
Feb 5, 2019 - C++
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
machine-learning
computer-vision
neural-network
opencl
inference
amd-opencl
virtual-reality
rocm
inference-engine
openvx
ryzen
onnx
windows-machine-learning
amd-openvx
khronos-openvx
openvx-neural-network
amd-opencv
nnef
winml
openvx-extensions
-
Updated
Jul 8, 2022 - C++
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
-
Updated
Jul 7, 2022 - Go
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
linear-algebra
mpi
cuda
scalapack
matrix-multiplication
gpu-acceleration
rocm
matmul
communication-optimal
pdgemm
-
Updated
Jul 7, 2022 - C++
AMD OpenVX modules: such as, neural network inference, 360 video stitching, etc.
video-stitching
rocm
radeon-open-compute
openvx
onnx
neural-network-inference
radeon-instinct-mi-series
radeon-vega-series
-
Updated
Feb 5, 2019 - C++
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
-
Updated
May 24, 2022 - Fortran
The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to discover bottlenecks in the application and to find ways to optimize the application's performance.
-
Updated
Jun 16, 2020 - C++
Domain specific library for electronic structure calculations
cmake
gpu
mpi
cuda
density-functional-theory
hdf5
sirius
hdfs
spack
fftw
libxc
gsl
rocm
electronic-structure-calculations
pseudopotential
planewave
full-potential
lapw
spglib
piz-daint
-
Updated
Jul 6, 2022 - C++
AMD OpenCL userspace drivers for Fedora.
-
Updated
Jun 24, 2022 - Shell
Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support
-
Updated
Feb 17, 2022 - C++
Install guide of ROCm and Tensorflow on Ubuntu for the RX580
-
Updated
Nov 22, 2021
ROCm Machine Learning and HPC Stack installer
-
Updated
Jul 31, 2020 - Shell
Improve this page
Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."
Description
https://numpy.org/doc/stable/reference/generated/numpy.corrcoef.html
https://docs.cupy.dev/en/stable/reference/generated/cupy.corrcoef.html
Seems args are different
Additional Information
dtypeargument added in NumPy version 1.20.