Improve graph feature extraction and add utilities for visualization and embedding search by swagat-mishra28 · Pull Request #8 · humanai-foundation/ArtExtract

swagat-mishra28 · 2026-03-07T21:50:31Z

Hi,

While exploring the graph construction and embedding pipeline, I made a few small improvements that may help with inspecting the graph representations and working with the generated embeddings.

Main changes include:

• extended the node feature extraction in extract_node() (in utils/build_graph.py) to include additional region statistics and a simple texture descriptor derived from gradient magnitude
• added a small utility visualize_segments() in utils/build_graph.py to visualize the SLIC superpixel segmentation used for graph construction
• added find_similar_embeddings() in embedding.py to perform cosine-similarity search over the generated embeddings, along with a helper to retrieve similar images from the embedding space
• added save_overlay() in utils/visualization.py to optionally save heatmap overlays when inspecting hidden-art visualizations
• added a small dataset integrity check using Image.open(path).verify() to skip corrupted images during dataset construction
• included a few minor stability improvements in the embedding pipeline (handling NaNs, saving embedding metadata, and small usability tweaks)

These additions are mainly intended to make it easier to inspect graph construction, debug segmentation, and experiment with the embeddings produced by the encoder when analyzing paintings.

Please let me know if any of these utilities should be structured differently or integrated elsewhere in the workflow.

… dataset loader

…tools

swagat-mishra28 added 4 commits March 7, 2026 23:13

Improve inference robustness and embedding handling

7065519

Fix duplicate load_inference_datasets definition and remove tqdm from…

1d104a2

… dataset loader

Add configurable checkpoint path and validation in inference pipeline

4f53162

Improve graph representation, embedding utilities, and visualization …

47662ac

…tools

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve graph feature extraction and add utilities for visualization and embedding search#8

Improve graph feature extraction and add utilities for visualization and embedding search#8
swagat-mishra28 wants to merge 4 commits intohumanai-foundation:mainfrom
swagat-mishra28:improve-graph-embedding-pipeline

swagat-mishra28 commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

swagat-mishra28 commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant