Improved condense hierarchy of HDBSCAN #7459

jinsolp · 2025-11-07T02:46:47Z

This PR optimizes the build_condensed_hierarchy of HDBSCAN.
Our previous implementation runs a top-down bfs tree traversal, where the GPU kernel is launched for every level of the tree. This is very slow because the tree is not balanced.

This PR introduces a bottom-up approach by pointer-chasing up to the parent on the CPU using omp threads.
This is much faster without any accuracy loss in the final result.

Table below shows two main parts of our HDBSCAN implementation (build linkage, and condense).
adjusted_rand_score is computed against our implementation using brute force graph build + original GPU condense implementation.

BF + orig : Brute force MR graph build + original top-down GPU condense
NND + orig: nn-descent MR graph build + original top-down GPU condense
BF + new: Brute force MR graph build + new bottom-up CPU condense in this PR
NND + new: nn-descent MR graph build + new bottom-up CPU condense in this PR

jinsolp · 2025-11-07T20:10:12Z

It seems like the number of omp threads affect the performance, but the time doesn't always scale linearly. I believe it depends on what the tree looks like.

I think the number of persistent nodes might matter, because if there are more persistent nodes, each thread is more likely to climb "less levels" of the tree (because it climbs until it runs into a persistent node).
The ratio of (persistent nodes) / (total internal nodes) might have to do with this.

cpp/src/hdbscan/detail/condense.cuh

jinsolp · 2025-11-12T18:33:26Z

Heuristics are determined after investigating that the persistent/internal node ratio does affect the perf of this CPU implementation

csadorf

Review is not yet complete, but here is a first set of comments.

csadorf · 2025-11-19T21:26:50Z

cpp/src/hdbscan/detail/condense.cuh

+
+/* Heuristic dispatching to CPU. A high persistent_ratio means there are more chances to stop early
+as we climb up the tree, making it more efficient for bottom-up CPU approach*/
+bool dispatch_to_cpu(int num_persistent, int n_leaves)


I think there is a very good chance that after we introduce this change, all of our testing either dispatches to CPU or does not dispatch to CPU. We need to ensure that we have a variety of test conditions such that both paths are hit.

cpp/src/hdbscan/detail/condense.cuh

csadorf · 2025-11-19T22:00:03Z

cpp/src/hdbscan/detail/condense.cuh

+  if (persistent_ratio >= 0.001) {
+    return true;
+  } else if (persistent_ratio >= 0.0001 && num_omp_threads >= 16) {
+    return true;
+  } else if (num_omp_threads >= 64) {
+    return true;
+  } else {
+    return false;
+  }
+}


Should this heuristic really be independent of data set size?

This is independent of dataset size because it's branching based on the ratio, not the absolute number

My point is that I suspect that this heuristic should take dataset size into account. That's something we should at least evaluate.

cpp/src/hdbscan/detail/condense.cuh

Still in evaluation stage.

jinsolp · 2025-11-20T17:19:49Z

Changing target branch to main to target 26.02

copy-pr-bot · 2025-11-21T00:06:43Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

jinsolp · 2025-11-21T00:10:03Z

force pushed because of issues while rebasing to the main branch

jinsolp requested a review from a team as a code owner November 7, 2025 02:46

jinsolp requested a review from lowener November 7, 2025 02:46

github-actions bot added the CUDA/C++ label Nov 7, 2025

github-actions bot assigned jinsolp Nov 7, 2025

jinsolp changed the title ~~condense on cpu~~ Optimizing condense hierarchy of HDBSCAN Nov 7, 2025

jinsolp added non-breaking Non-breaking change improvement Improvement / enhancement to an existing function labels Nov 7, 2025

jinsolp changed the title ~~Optimizing condense hierarchy of HDBSCAN~~ Improved condense hierarchy of HDBSCAN Nov 7, 2025

divyegala self-requested a review November 7, 2025 04:28

jinsolp commented Nov 12, 2025

View reviewed changes

cpp/src/hdbscan/detail/condense.cuh Show resolved Hide resolved

jinsolp changed the base branch from main to release/25.12 November 17, 2025 17:03

csadorf requested review from csadorf and removed request for lowener November 17, 2025 17:20

csadorf requested changes Nov 19, 2025

View reviewed changes

divyegala previously approved these changes Nov 20, 2025

View reviewed changes

jinsolp changed the base branch from release/25.12 to main November 20, 2025 17:32

jinsolp force-pushed the opt-condense-hierarchy branch from 78ce1b0 to d914709 Compare November 21, 2025 00:06

jinsolp requested review from a team as code owners November 21, 2025 00:06

jinsolp requested review from jcrist and msarahan November 21, 2025 00:06

github-actions bot added conda conda issue Cython / Python Cython or Python issue ci labels Nov 21, 2025

jinsolp force-pushed the opt-condense-hierarchy branch from d914709 to 1ce68a4 Compare November 21, 2025 00:07

github-actions bot removed conda conda issue Cython / Python Cython or Python issue ci labels Nov 21, 2025

improve condense hierarchy

409acf0

jinsolp force-pushed the opt-condense-hierarchy branch from 1ce68a4 to 409acf0 Compare November 21, 2025 00:09

jinsolp removed request for a team, jcrist and msarahan November 21, 2025 00:09

Improved condense hierarchy of HDBSCAN #7459

Are you sure you want to change the base?

Improved condense hierarchy of HDBSCAN #7459

Conversation

jinsolp commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jinsolp commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jinsolp commented Nov 12, 2025

Uh oh!

csadorf left a comment

Choose a reason for hiding this comment

Uh oh!

csadorf Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

csadorf Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

jinsolp Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

csadorf Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jinsolp commented Nov 20, 2025

Uh oh!

copy-pr-bot bot commented Nov 21, 2025

Uh oh!

jinsolp commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jinsolp commented Nov 7, 2025 •

edited

Loading

jinsolp commented Nov 7, 2025 •

edited

Loading