Skip to content

Fix data_alloc_size query bugs#21955

Merged
rapids-bot[bot] merged 7 commits intorapidsai:mainfrom
rjzamora:fix-mem-size-bugs
Apr 1, 2026
Merged

Fix data_alloc_size query bugs#21955
rapids-bot[bot] merged 7 commits intorapidsai:mainfrom
rjzamora:fix-mem-size-bugs

Conversation

@rjzamora
Copy link
Copy Markdown
Member

Description

I think we want to avoid specifying MemoryType.DEVICE when we query the allocation size of a TableChunk. The data may be in host or device memory, so we want to leave the argument empty. This PR also addresses a TODO statement related to un-spilling data to access the row-count.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@rjzamora rjzamora self-assigned this Mar 30, 2026
@rjzamora rjzamora requested a review from a team as a code owner March 30, 2026 22:05
@rjzamora rjzamora added bug Something isn't working 2 - In Progress Currently a work in progress non-breaking Non-breaking change labels Mar 30, 2026
@github-actions github-actions bot added Python Affects Python cuDF API. cudf-polars Issues specific to cudf-polars labels Mar 30, 2026
@GPUtester GPUtester moved this to In Progress in cuDF Python Mar 30, 2026
@rjzamora rjzamora added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Mar 31, 2026
Comment on lines +55 to 67
chunk = chunk.make_available_and_spill(
self.context.br(), allow_overbooking=True
)
self.allgather.insert(
sequence_number,
PackedData.from_cudf_packed_columns(
pack(
chunk.table_view(),
chunk.stream,
),
chunk.stream,
self.context.br(),
),
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you open an issue in rapidsmpf. If the table is already in packed form (e.g. either spilled or unspilled from packed), this does a copy for no reason.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about rapidsai/rapidsmpf#933 ? (feel free to correct/close that issue if I misunderstood your meaning here)

@github-actions github-actions bot added the pylibcudf Issues specific to the pylibcudf package label Mar 31, 2026
@rjzamora rjzamora added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Apr 1, 2026
@rjzamora
Copy link
Copy Markdown
Member Author

rjzamora commented Apr 1, 2026

/merge

@rapids-bot rapids-bot bot merged commit b7a9d4d into rapidsai:main Apr 1, 2026
109 checks passed
@github-project-automation github-project-automation bot moved this from In Progress to Done in cuDF Python Apr 1, 2026
@rjzamora rjzamora deleted the fix-mem-size-bugs branch April 1, 2026 13:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

5 - Ready to Merge Testing and reviews complete, ready to merge bug Something isn't working cudf-polars Issues specific to cudf-polars non-breaking Non-breaking change pylibcudf Issues specific to the pylibcudf package Python Affects Python cuDF API.

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants