Simplify `cuda::host_launch` API #6689

davebayer · 2025-11-19T14:51:15Z

Previously, we had a special overload for cases when the user passed cuda::std::reference_wrapper as the callable without any arguments.

This PR removes this overload and handles it inside the generic implementation. In addition, also functions returning void without arguments are launched using cuLaunchHostFunc which doesn't require memory allocation.

pciolkosz · 2025-11-19T18:42:12Z

libcudacxx/include/cuda/__launch/host_launch.h

-//! @param __stream Stream to launch the host function on
-//! @param __callable A reference to a host function or callable object to call in stream order
-template <class _Callable>
-_CCCL_HOST_API void host_launch(stream_ref __stream, ::cuda::std::reference_wrapper<_Callable> __callable)


I would prefer to keep the separate overload, it's easier to document this mode. Otherwise with one overload you need to describe the set of conditions to avoid the allocation, where here you have them expressed in code

I don't agree, actually I think that it makes everything much easier. We would have a single function. We can simply document the behaviour without talking about memory allocations, pointing out the option to use cuda::std::reference_wrapper for cases when the user wants to pass a reference to a callable or an argument.

Then, we usually have Performance Considerations section where we would describe that if there are no parameters passed to the function and the function is either a free function or a cuda::std::reference_wrapper we use cuLaunchHostFunc without memory allocations and cuStreamAddCallback otherwise.

I think this makes everything much cleaner.

People usually just glance the documentation and they will immediately notice two overloads, a very small subset will read the performance considerations section.
I actually think the overload you are removing is more important than the other one and should be used more often, that's why I want it to be as visible as possible.

pciolkosz · 2025-11-19T18:43:09Z

libcudacxx/include/cuda/__launch/host_launch.h

-  // We use the callback here to have it execute even on stream error, because it needs to free the above allocation
-  ::cuda::__driver::__streamAddCallback(__stream.get(), __stream_callback_launcher<_CallbackData>, __callback_data_ptr);
-}
+  if constexpr (!__has_args && ::cuda::std::is_function_v<_Callable> && ::cuda::std::is_pointer_v<_Callable>)


I like this

github-actions · 2025-11-19T23:19:08Z

🥳 CI Workflow Results

🟩 Finished in 1h 34m: Pass: 100%/90 | Total: 12h 57m | Max: 53m 25s | Hits: 99%/213937

See results here.

davebayer requested a review from a team as a code owner November 19, 2025 14:51

davebayer requested a review from bernhardmgruber November 19, 2025 14:51

github-project-automation bot added this to CCCL Nov 19, 2025

github-project-automation bot moved this to Todo in CCCL Nov 19, 2025

cccl-authenticator-app bot moved this from Todo to In Review in CCCL Nov 19, 2025

davebayer added the backport branch/3.2.x label Nov 19, 2025

davebayer self-assigned this Nov 19, 2025

Simplify cuda::host_launch API

8b64eb2

davebayer force-pushed the remove_host_launch_ref_wrapper branch from c313a52 to 8b64eb2 Compare November 19, 2025 16:06

This comment has been minimized.

Sign in to view

davebayer requested a review from pciolkosz November 19, 2025 18:35

pciolkosz reviewed Nov 19, 2025

View reviewed changes

add missing include and add explicit cast to void*

867bedc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify `cuda::host_launch` API #6689

Simplify `cuda::host_launch` API #6689

Uh oh!

davebayer commented Nov 19, 2025

Uh oh!

This comment has been minimized.

pciolkosz Nov 19, 2025

Uh oh!

davebayer Nov 19, 2025

Uh oh!

pciolkosz Nov 19, 2025

Uh oh!

pciolkosz Nov 19, 2025

Uh oh!

github-actions bot commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Simplify cuda::host_launch API #6689

Are you sure you want to change the base?

Simplify cuda::host_launch API #6689

Uh oh!

Conversation

davebayer commented Nov 19, 2025

Uh oh!

This comment has been minimized.

pciolkosz Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

davebayer Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

pciolkosz Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

pciolkosz Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 19, 2025

🥳 CI Workflow Results

🟩 Finished in 1h 34m: Pass: 100%/90 | Total: 12h 57m | Max: 53m 25s | Hits: 99%/213937

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Simplify `cuda::host_launch` API #6689

Simplify `cuda::host_launch` API #6689