Releases: KernelTuner/kernel_tuner
Version 0.1.1
[0.1.1] - 2017-02-10
Changed
- changed the output format to list of dictionaries
Added
- option to set compiler options
version 0.1.0
Version 0.1.0
The Kernel Tuner should by now be ready for production use. Over the last few months we have used it in several projects, which has revealed some of the things that were fixed in this version. This release also marks the end of a period in which the internal structure of the Kernel Tuner has changed several times. We expect the current code structure to stay around for a while. With this version we also release the public roadmap for the project, to show which changes and additional features we have planned for the near and not so near future. We also feel that the software is now ready to be added to public software repositories, which we will do shortly.
first beta release
This is the first beta release of the Kernel Tuner.
This release basically marks the first version of the kernel tuner, which is currently in beta testing to see what functionality is missing and what needs to be fixed before the code can be considered production ready.
A brief description of the Kernel Tuner's functionality in this version:
- Basic kernel tuning functionality for CUDA, OpenCL, and C functions
- Many examples and rather extensive documentation
- Search space restriction, using the 'restrictions' option
- Kernel output verification, using the 'answer' option
- Example showing how to tune both host code (number of streams) and GPU code
- Run a single kernel with a specific parameter set and get the output