MixDiff

Note: This repository is currently a work in progress!

[PyTorch] Code for the paper - MixDiff: Mixing Natural and Synthetic Images for Robust Self-Supervised Representations, WACV, 2025.

[Link to the Paper (arxiv)]

MixDiff is a self-supervised learning (SSL) pre-training framework that leverages both real and synthetic images to enhance representation learning. Unlike traditional SSL methods that rely heavily on real images, MixDiff introduces a novel approach by incorporating a variant of Stable Diffusion to replace an augmented instance of a real image. This enables the model to learn cross real-synthetic representations effectively. Our experiments confirm that MixDiff not only improves performance but also reduces the dependency on large amounts of real data, making it an efficient and versatile framework for SSL.

Comparison of SimCLR performance on real, synthetic (Syn), and mixed real and synthetic images (MixDiff). The radar charts show normalized accuracy across 8 transfer learning datasets (left) and ImageNet-1K plus 6 distribution shift datasets (right), with values from 0.5 to 1.1. MixDiff enhances in-distribution and robustness performance and generalizes better.

Installation

conda create -y -n ffcv-ssl python=3.9 cupy pkg-config compilers libjpeg-turbo opencv pytorch torchvision torchaudio pytorch-cuda=11.7 numba -c pytorch -c nvidia -c conda-forge
conda activate ffcv-ssl
pip install -e .

Troubleshooting note: if the above commands result in a package conflict error, try running conda config --env --set channel_priority flexible in the environment and rerunning the installation command. For detailed installation instructions, please refer to the FFCV library and FFCV-SSL library.

Generating Synthetic Images

To generate images using Stable Diffusion or Versatile Diffusion models from an input dataset, please refer to the generating_images folder.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
configs		configs
ffcv		ffcv
generating_images		generating_images
libffcv		libffcv
modules		modules
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
image_to_ffcv.py		image_to_ffcv.py
test.py		test.py
train_mixdiff.py		train_mixdiff.py
train_ssl.py		train_ssl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MixDiff

Installation

Generating Synthetic Images

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MixDiff

Installation

Generating Synthetic Images

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages