Learnable dof feature as #275

Artemii-Shlychkov · 2025-10-22T23:50:16Z

Issue

T-distributed Stochastic Neighbor Embedding (t-SNE) is a widely used tool for dimensionality reduction and visualization of high-dimensional datasets. By replacing the Gaussian kernel in SNE with a Cauchy kernel (Student t-distribution with the degree-of-freedom (dof) parameter alpha set to 1), it alleviates the “crowding problem” in low-dimensional embeddings. Varying the degree-of-freedom parameter alpha affects t-SNE embeddings on both toy and real-world datasets (e.g., MNIST and single-cell RNA sequencing). Moreover, alpha can be regarded as a trainable parameter, allowing it to be adjusted during embedding optimization via a gradient-based method. Alpha values different from 1 can yield superior embeddings, reflected by reduced Kullback-Leibler (KL) divergence and higher k-Nearest Neighbors (kNN) recall scores, at least on some datasets. Overall, this suggests that alpha optimization can lead to more faithful low-dimensional representations of high-dimensional data.

Description of changes

_tsne.pyx
Main changes with regards to dof optimization:

estimate_positive_gradient_nn function:
- added computations for the alpha gradient positive term
_estimate_negative_gradient_single function
- added computations for the alpha gradient negative term
estimate_negative_gradient_bh function
- added normalization of alpha gradient negative term on sum_Q

tsne.py

added dataclass OptimizationStats to track changes in KL-divergence, dof values, alpha gradient values and embeddings with every iteration
kl_divergence_bh function
- added dof gradient computations (alpha_grad), based on the ouputs of _tsne module
gradient_descent class
- added optional optimize_for_alpha bool argument to trigger dof-optimization
- added optional dof_lr argument
- added optional dof update with current value of alpha_grad and dof_lr learning rate
- eval_error_every_iter argument to make tracking of KL-divergence more flexible

Includes

Code changes
Tests
Documentation

pavlin-policar · 2025-10-23T09:31:45Z

openTSNE/tsne.py

        exaggeration=None,
        dof=1,
+        optimize_for_alpha=False,
+        dof_lr=0.5,


Can we connect this to the t-SNE learning rate somehow? Don't like the extra parameter

pavlin-policar · 2025-10-23T09:31:47Z

openTSNE/tsne.py

        momentum=0.8,
        exaggeration=None,
        dof=1,
+        optimize_for_alpha=False,


activate with dof="auto" for optimization

pavlin-policar · 2025-10-23T09:31:57Z

openTSNE/tsne.py

            )

+            if optimize_for_alpha:
+                dof -= dof_lr * alpha_grad


This is a new optimizer. Can we optimize with the existing delta-bar-delta optimizer? Let's look at some loss curves.

a-shlychkov-philips added 2 commits October 22, 2025 23:07

Changes to allow for dof optimization

f6b091b

minor refactoring

450c2bb

pavlin-policar reviewed Oct 23, 2025

View reviewed changes

a-shlychkov-philips and others added 5 commits October 29, 2025 22:16

changes to allow for dof learning

15aea45

notebook with some tests

a674e1f

Merge branch 'changes-for-learnable-dof' into learnable-dof-feature-AS

7ae101e

Merge branch 'master' into learnable-dof-feature-AS

a4f9d63

files refactoring; removing unnecessary imports

1c37da9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Learnable dof feature as #275

Learnable dof feature as #275

Uh oh!

Artemii-Shlychkov commented Oct 22, 2025

Uh oh!

pavlin-policar Oct 23, 2025

Uh oh!

pavlin-policar Oct 23, 2025

Uh oh!

pavlin-policar Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Learnable dof feature as #275

Are you sure you want to change the base?

Learnable dof feature as #275

Uh oh!

Conversation

Artemii-Shlychkov commented Oct 22, 2025

Issue

Description of changes

Includes

Uh oh!

pavlin-policar Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

pavlin-policar Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

pavlin-policar Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants