Skip to content

Conversation

@rjzamora
Copy link
Member

Introduces a simple ToBackend (and ToPandasBackend implementation).

The current approach of using map_partitions in PandasBackendEntrypoint effectively blocks query-planning optimizations when data is moved between GPU and CPU. The obvious solution is to use a simple Expr class.

@rjzamora
Copy link
Member Author

@phofl - This PR is (hopefully) much less controversial than the resource barrier idea in #1116 :)

@phofl phofl merged commit 37a5116 into dask:main Aug 13, 2024
@phofl
Copy link
Collaborator

phofl commented Aug 13, 2024

thx

@rjzamora rjzamora deleted the to-backend-expr branch August 13, 2024 15:19
rapids-bot bot pushed a commit to rapidsai/cudf that referenced this pull request Aug 19, 2024
Adds a `ToCudfBackend` expression for "pandas" to "cudf" conversion, preventing `to_backend("cudf")` operations from blocking useful optimizations like predicate pushdown.

This is the dask-cudf component of dask/dask-expr#1115

Authors:
  - Richard (Rick) Zamora (https://github.com/rjzamora)

Approvers:
  - Mads R. B. Kristensen (https://github.com/madsbk)

URL: #16573
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants