22
33Full documentation for rocFFT is available at [ rocfft.readthedocs.io] ( https://rocfft.readthedocs.io/en/latest/ ) .
44
5- ## [ (Unreleased) rocFFT 1.0.10 for ROCm 4.1.0]
5+ ## [ rocFFT 1.0.10 for ROCm 4.1.0]
66
77### Added
88- Explicitly specify MAX_THREADS_PER_BLOCK through _ \_ launch\_ bounds\_ for all
9- manual kernels.
9+ kernels.
10+ - Switch to new syntax for specifying AMD GPU architecture names and features.
1011
1112### Optimizations
1213- Optimized C2C/R2C 3D 64, 81, 100, 128, 200, 256 cube sizes.
@@ -18,15 +19,6 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
1819### Changed
1920- rocFFT now automatically allocates a work buffer if the plan
2021 requires one but none is provided.
21-
22- ## [ (Unreleased) rocFFT 1.0.9 for ROCm 4.0.0]
23-
24- ### Added
25- - Explicitly specify MAX_THREADS_PER_BLOCK through _ \_ launch\_ bounds\_ for all
26- generated kernels.
27- - Switch to new syntax for specifying AMD GPU architecture names and features.
28-
29- ### Changed
3022- An explicit ` rocfft_status_invalid_work_buffer ` error is now
3123 returned when a work buffer of insufficient size is provided.
3224- Updated online documentation.
@@ -90,4 +82,4 @@ Full documentation for rocFFT is available at [rocfft.readthedocs.io](https://ro
9082- Fixed correctness of certain large 1D, and 2D power-of-3, 5 sizes.
9183- Fixed incorrect buffer assignment for some even-length R2C transforms.
9284- Fixed ` <cstddef> ` inclusion on C compilers.
93- - Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.
85+ - Fixed incorrect results on non-unit strides with SBCC/SBRC kernels.
0 commit comments