gpgpu
Here are 386 public repositories matching this topic...
-
Updated
Oct 16, 2021 - C++
Current implementation of join can be improved by performing the operation in a single call to the backend kernel instead of multiple calls.
This is a fairly easy kernel and may be a good issue for someone getting to know CUDA/ArrayFire internals. Ping me if you want additional info.
-
Updated
Oct 3, 2021 - C++
In order to test manually altered IR, it would be nice to have a --skip-compilation flag for futhark test, just like we do for futhark bench.
-
Updated
Apr 1, 2021 - Rust
-
Updated
Jul 30, 2018
-
Updated
Oct 7, 2021 - C#
-
Updated
Oct 5, 2021 - C++
-
Updated
Oct 15, 2021 - Objective-C
-
Updated
Oct 8, 2021 - Clojure
-
Updated
Oct 14, 2021 - Nim
-
Updated
Oct 4, 2021 - C++
-
Updated
Jul 26, 2021 - C++
Open issue to openly discuss potential ideas or improvements, whether on documentation, interfaces, examples, bug fixes, etc.
Add Javadoc to document the examples in TornadoVM.
This affects the packages under the examples module:
The documentation is at the class-level and it will contain a description of how the TornadoVM API is used for each example. Additionally, it contains how to run the example
-
Updated
Sep 15, 2021 - C
-
Updated
Jun 22, 2021 - Rust
-
Updated
Aug 17, 2016 - JavaScript
Bug summary
There is evidence that sub_group::get_group_id() does not return the same value as threadIdx.x / warpSize (assuming 1D kernel), as expected on CUDA. We should check the implementation of this function. Our implementation of this function performs bit manipulation magic, presumably the optimization went to far...
To Reproduce
Compare sub_group{}.get_group_id() or `sub
-
Updated
Oct 4, 2021 - C++
Improve this page
Add a description, image, and links to the gpgpu topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpgpu topic, visit your repo's landing page and select "manage topics."
Our users are often confused by the output from programs such as zip2john sometimes being very large (multi-gigabyte). Maybe we should identify and enhance these programs to output a message to stderr to explain to users that it's normal for the output to be very large - maybe always or maybe only when the output size is above a threshold (e.g., 1 million bytes?)