You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Mar 21, 2024. It is now read-only.
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
<hr>
2
2
<h3>About CUB</h3>
3
3
4
-
Current release: v1.5.2 (03/21/2016)
4
+
Current release: v1.5.3 (10/11/2016)
5
5
6
6
We recommend the [CUB Project Website](http://nvlabs.github.com/cub) and the [cub-users discussion forum](http://groups.google.com/group/cub-users) for further information and examples.
* \brief Computes an exclusive block-wide prefix scan using the specified binary \p scan_op functor. Each thread contributes an array of consecutive input elements. With no initial value, the output computed for <em>thread</em><sub>0</sub> is undefined.
1174
+
*
1175
+
* \par
1176
+
* - Supports non-commutative scan operators.
1177
+
* - \blocked
1178
+
* - \granularity
1179
+
* - \smemreuse
1180
+
*
1181
+
* \tparam ITEMS_PER_THREAD <b>[inferred]</b> The number of consecutive items partitioned onto each thread.
1182
+
* \tparam ScanOp <b>[inferred]</b> Binary scan functor type having member <tt>T operator()(const T &a, const T &b)</tt>
1183
+
*/
1184
+
template <
1185
+
int ITEMS_PER_THREAD,
1186
+
typename ScanOp>
1187
+
__device____forceinline__voidExclusiveScan(
1188
+
T (&input)[ITEMS_PER_THREAD], ///< [in] Calling thread's input items
1189
+
T (&output)[ITEMS_PER_THREAD], ///< [out] Calling thread's output items (may be aliased to \p input)
* \brief Computes an exclusive block-wide prefix scan using the specified binary \p scan_op functor. Each thread contributes an array of consecutive input elements. Also provides every thread with the block-wide \p block_aggregate of all inputs. With no initial value, the output computed for <em>thread</em><sub>0</sub> is undefined.
1205
+
*
1206
+
* \par
1207
+
* - Supports non-commutative scan operators.
1208
+
* - \blocked
1209
+
* - \granularity
1210
+
* - \smemreuse
1211
+
*
1212
+
* \tparam ITEMS_PER_THREAD <b>[inferred]</b> The number of consecutive items partitioned onto each thread.
1213
+
* \tparam ScanOp <b>[inferred]</b> Binary scan functor type having member <tt>T operator()(const T &a, const T &b)</tt>
1214
+
*/
1215
+
template <
1216
+
int ITEMS_PER_THREAD,
1217
+
typename ScanOp>
1218
+
__device____forceinline__voidExclusiveScan(
1219
+
T (&input)[ITEMS_PER_THREAD], ///< [in] Calling thread's input items
1220
+
T (&output)[ITEMS_PER_THREAD], ///< [out] Calling thread's output items (may be aliased to \p input)
1221
+
ScanOp scan_op, ///< [in] Binary scan functor
1222
+
T &block_aggregate) ///< [out] block-wide aggregate reduction of input items
0 commit comments