Skip to content
#

cxx

Here are 367 public repositories matching this topic...

thrust
nv-dlasalle
nv-dlasalle commented Mar 19, 2021

Problem

Cub allows itself to place into a namespace via CUB_NS_PREFIX and CUB_NS_POSTFIX, such that multiple shared libraries can each utilize their own copy of it (and thus different versions can safely coexist). Static variables used for caching could otherwise cause problems (e.g., https://github.com/NVIDIA/cub/blob/main/cub/util_device.cuh#L212).

Thrust however depends on cub and

cub
allisonvacanti
allisonvacanti commented Apr 23, 2021

DeviceSegmentedReduce reduces each segment using the same kernel as DeviceReduce, and thus provides the same run-to-run consistency guarantee for non-associative types. The documentation for DeviceSegmentedReduce should be updated to reflect this.

E.g. this bit: https://github.com/NVIDIA/cub/blob/a8910accebe74ce043a13026f8e71d678cddd6c1/cub/device/device_reduce.cuh#L91-L95
needs to be ad

Improve this page

Add a description, image, and links to the cxx topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cxx topic, visit your repo's landing page and select "manage topics."

Learn more