|
366 | 366 | * - <b><em>Blocked arrangement</em></b>. The aggregate tile of items is partitioned |
367 | 367 | * evenly across threads in "blocked" fashion with thread<sub><em>i</em></sub> |
368 | 368 | * owning the <em>i</em><sup>th</sup> segment of consecutive elements. |
| 369 | + * Blocked arrangements are often desirable for algorithmic benefits (where |
| 370 | + * long sequences of items can be processed sequentially within each thread). |
369 | 371 | * </td> |
370 | 372 | * <td> |
371 | 373 | * \par |
|
377 | 379 | * \par |
378 | 380 | * - <b><em>Striped arrangement</em></b>. The aggregate tile of items is partitioned across |
379 | 381 | * threads in "striped" fashion, i.e., the \p ITEMS_PER_THREAD items owned by |
380 | | - * each thread have logical stride \p BLOCK_THREADS between them. |
| 382 | + * each thread have logical stride \p BLOCK_THREADS between them. Striped arrangements |
| 383 | + * are often desirable for data movement through global memory (where |
| 384 | + * [read/write coalescing](http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/#coalesced-access-global-memory)</a> |
| 385 | + * is an important performance consideration). |
381 | 386 | * </td> |
382 | 387 | * <td> |
383 | 388 | * \par |
|
398 | 403 | * facilitates greater ILP for improved throughput and utilization. |
399 | 404 | * |
400 | 405 | * \par |
401 | | - * Furthermore, cub::BlockExchange provides operations for converting between blocked |
402 | | - * and striped arrangements. Blocked arrangements are often desirable for |
403 | | - * algorithmic benefits (where long sequences of items can be processed sequentially |
404 | | - * within each thread). Striped arrangements are often desirable for data movement |
405 | | - * through global memory (where |
406 | | - * [read/write coalescing](http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/#coalesced-access-global-memory)</a> |
407 | | - * is an important performance consideration). |
| 406 | + * Finally, cub::BlockExchange provides operations for converting between blocked |
| 407 | + * and striped arrangements. |
408 | 408 | * |
409 | 409 | * \section sec7 (7) Contributors |
410 | 410 | * |
|
0 commit comments