A place for my personal CUDA tinkering to reside. The layout of the repository adheres to The Pitchfork Layout, specifically using seperate header placement and merged test placement.
From the repository root:
mkdir build && cd build && cmake ..
For example, to build Tests for Release on Windows:
msbuild.exe Tests.vcxproj /property:Platform=x64 /property:Configuration=Release
To then run the Tests executable:
Release\Tests.exe
Work In Progress. Currently toying around with my exclusive scan implementation using the CUDA cooperative groups API.
Given a binary associative operator
The canonical single-precision ax + y kernel:
CUDA and host utility functions that are useful when working with the CUDA programming model.