Improved memory efficiency in UMAP given precomputed knn graphs #7481

jinsolp · 2025-11-13T03:01:35Z

This PR improves memory usage in UMAP when given a precomputed knn graph.
Previously, a user-given knn graph will occupy GPU memory throughout the full UMAP pipeline even though it is not needed in later steps of UMAP.

In this PR, if the user-given knn graph is on host memory, we keep it on host memory and copy to device at the cpp level to allow better memory management.

This PR with precomputed knn graph on CPU

Before with precomputed knn graph on CPU

tarang-jain · 2025-11-13T17:56:38Z

cpp/include/cuml/manifold/common.hpp

+        cudaGetLastError();
+        return false;  // Assume host pointer if query fails
+      }
+      return attr.type == cudaMemoryTypeDevice || attr.type == cudaMemoryTypeManaged;


this pattern of checking if its a device pointer is being repeated way too much. In cuvs and raft we have abstracted it away into functions such as check_pointer_residency. Can you check if you can use those directly? If you cannot use those directly, we should create an issue for this -- ideally we'd have a function in raft that does this.

Changes to using check_pointer_residency!

hcho3

LGTM

python/cuml/cuml/common/sparsefuncs.py

csadorf

LGTM. Just a few minor suggestions for improved wording in the docs.

python/cuml/cuml/manifold/umap/umap.pyx

jinsolp · 2025-11-19T21:42:47Z

Had to change the default based on the changes of this PR: #7501

cpp/include/cuml/manifold/common.hpp

jinsolp · 2025-11-21T02:21:10Z

/merge

jinsolp added 3 commits November 13, 2025 02:12

opt knn mem usage

7eabf64

docs

1b0a96e

tab

a4dab3a

jinsolp self-assigned this Nov 13, 2025

jinsolp requested review from a team as code owners November 13, 2025 03:01

jinsolp requested review from betatim and hcho3 November 13, 2025 03:01

github-actions bot added Cython / Python Cython or Python issue CUDA/C++ labels Nov 13, 2025

jinsolp added non-breaking Non-breaking change improvement Improvement / enhancement to an existing function labels Nov 13, 2025

tarang-jain reviewed Nov 13, 2025

View reviewed changes

jinsolp added 4 commits November 13, 2025 21:12

use check_pointer_residency func

f1d7ba1

rm cuda header

1c13cff

Merge branch 'main' into opt-umap-precomputed-knn-mem

598eb9f

Merge branch 'main' into opt-umap-precomputed-knn-mem

61a1e28

hcho3 approved these changes Nov 14, 2025

View reviewed changes

python/cuml/cuml/common/sparsefuncs.py Show resolved Hide resolved

Merge branch 'main' into opt-umap-precomputed-knn-mem

9b4bab3

jinsolp changed the base branch from main to release/25.12 November 17, 2025 17:03

jinsolp added 2 commits November 17, 2025 09:03

Merge branch 'release/25.12' into opt-umap-precomputed-knn-mem

c670d99

Merge branch 'release/25.12' into opt-umap-precomputed-knn-mem

1c95f96

csadorf approved these changes Nov 19, 2025

View reviewed changes

python/cuml/cuml/manifold/umap/umap.pyx Outdated Show resolved Hide resolved

python/cuml/cuml/manifold/umap/umap.pyx Outdated Show resolved Hide resolved

python/cuml/cuml/manifold/umap/umap.pyx Outdated Show resolved Hide resolved

jinsolp added 2 commits November 19, 2025 11:26

Merge branch 'release/25.12' into opt-umap-precomputed-knn-mem

3e92228

update docs

090d2f3

csadorf approved these changes Nov 19, 2025

View reviewed changes

change default

eae9fe6

Merge branch 'release/25.12' into opt-umap-precomputed-knn-mem

b0a634e

tarang-jain reviewed Nov 21, 2025

View reviewed changes

cpp/include/cuml/manifold/common.hpp Show resolved Hide resolved

tarang-jain approved these changes Nov 21, 2025

View reviewed changes

rapids-bot bot merged commit 45e220d into rapidsai:release/25.12 Nov 21, 2025
106 checks passed

jinsolp deleted the opt-umap-precomputed-knn-mem branch November 21, 2025 02:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved memory efficiency in UMAP given precomputed knn graphs #7481

Improved memory efficiency in UMAP given precomputed knn graphs #7481

Uh oh!

jinsolp commented Nov 13, 2025

Uh oh!

tarang-jain Nov 13, 2025

Uh oh!

jinsolp Nov 13, 2025

Uh oh!

hcho3 left a comment

Uh oh!

Uh oh!

csadorf left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jinsolp commented Nov 19, 2025

Uh oh!

Uh oh!

jinsolp commented Nov 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Improved memory efficiency in UMAP given precomputed knn graphs #7481

Improved memory efficiency in UMAP given precomputed knn graphs #7481

Uh oh!

Conversation

jinsolp commented Nov 13, 2025

This PR with precomputed knn graph on CPU

Before with precomputed knn graph on CPU

Uh oh!

tarang-jain Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

jinsolp Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

hcho3 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

csadorf left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jinsolp commented Nov 19, 2025

Uh oh!

Uh oh!

jinsolp commented Nov 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants