Image Distance (imgdist)

A command-line tool to calculate and compare various perceptual distance metrics between images.

Features

Calculates normalized Euclidean distances in the following color spaces (all scaled to 0-1 range):

YCbCr (with proper channel ranges: Y=219, Cb/Cr=224)
HSV (with circular distance correction for Hue channel)
LAB (perceptual color space)

Additional metrics:

Normalized Mean Delta E (MDE) - average perceptual color difference (0-1 range)
Structural Similarity Index (SSIM) - structural similarity on luminance channel (-1 to 1 range, typically 0-1 in practice)
Multi-Scale Structural Similarity Index (MS-SSIM) - multi-resolution structural similarity (-1 to 1 range, typically 0-1 in practice)

Usage

Basic comparison (2 images)

Compare one image against a reference:

Linux:

./imgdist.py reference.jpg compared.jpg

Windows:

py imgdist.py reference.jpg compared.jpg

Advanced comparison (3 images)

Compare two images side-by-side against the same reference:

./imgdist.py reference.jpg imageA.jpg imageB.jpg

This will display both comparisons in a convenient two-column layout, with the better metric highlighted in green for each comparison.

Options

-s, --ssim: Calculate SSIM score
-m, --ms_ssim: Calculate MS-SSIM score
-g, --gpu: Use GPU for SSIM/MS-SSIM calculations (can use CUDA-enabled PyTorch)

Examples

# Basic comparison with default metrics
./imgdist.py original.png compressed.jpg

# Include SSIM and MS-SSIM
./imgdist.py original.png compressed.jpg -s -m

# Compare two compression methods against original with GPU acceleration
./imgdist.py original.png method1.jpg method2.avif -s -m -g

Windows Setup (Optional)

Set file extension association once to run scripts directly:

assoc .py=Python.File
ftype Python.File="C:\WINDOWS\py.exe" "%L" %*

Then you can run:

imgdist.py [options] [image1] [image2]

Output Format

The script displays results in an easy-to-read format:

Side-by-side comparison when using 3 images
Color-coded results: better values highlighted in green
Organized by color space with clear separators
Average distances computed for each color space
Image dimensions displayed in the header

Understanding the metrics

Distance metrics (YCbCr, HSV, LAB, MDE) are normalized to the 0-1 range:

0 = identical images, 1 = maximum possible difference
Lower is better (more similar images)

The normalization of distance metrics allows for meaningful comparison across different color spaces and image sizes.

Similarity metrics (SSIM, MS-SSIM) have a theoretical range of -1 to 1:

1 = identical images (perfect similarity)
0 = no similarity
-1 = perfect anti-correlation (inverted images)
Higher is better. In practice with real images, values are typically in the 0-1 range.

Prerequisites

This script requires Python 3 and the following libraries:

Installation

Install most dependencies using pip:

pip install numpy Pillow pytorch-msssim

PyTorch Installation

PyTorch installation depends on your system and CUDA support. Use the official command generator to get the correct version:

Visit the PyTorch "Get Started" page

Example installations:

CPU only (no GPU):

pip3 install torch torchvision

With CUDA support (e.g., NVIDIA GTX 1080 Ti with CUDA 12.6):

pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu126

Note: Check your CUDA version with nvidia-smi and select the appropriate PyTorch build from the official website.

Requirements

Same dimensions: All compared images must have identical dimensions (width × height)
RGB images: Images are converted to RGB internally
Supported formats: Any format supported by PIL/Pillow (JPEG, PNG, BMP, TIFF, WebP, JPEG 2000 etc.). You can do pip3 install pillow_heif to support HEIC and pip3 install pillow-jxl-plugin to support JXL.

Technical Details

Color Space Handling

YCbCr: Uses standard ITU-R BT.601 ranges for normalization
HSV: Applies circular distance for Hue channel (accounts for 0°/360° wrap-around)
LAB: Uses PIL's LAB representation (0-255 scaled from CIE L*a*b*)

SSIM/MS-SSIM Calculation

Computed on grayscale (luminance) channel as per standard definition
GPU acceleration available with -g flag

Performance Tips

Use -g flag with a CUDA-enabled GPU for faster SSIM/MS-SSIM on large images
Images are preprocessed once per execution and reused for all metrics

License

CC0 1.0 Universal

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
images		images
README.md		README.md
imgdist.py		imgdist.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Distance (imgdist)

Features

Usage

Basic comparison (2 images)

Advanced comparison (3 images)

Options

Examples

Windows Setup (Optional)

Output Format

Understanding the metrics

Prerequisites

Installation

PyTorch Installation

Example installations:

Requirements

Technical Details

Color Space Handling

SSIM/MS-SSIM Calculation

Performance Tips

License

About

Uh oh!

Releases

Packages

Languages

colemar/imgdist

Folders and files

Latest commit

History

Repository files navigation

Image Distance (imgdist)

Features

Usage

Basic comparison (2 images)

Advanced comparison (3 images)

Options

Examples

Windows Setup (Optional)

Output Format

Understanding the metrics

Prerequisites

Installation

PyTorch Installation

Example installations:

Requirements

Technical Details

Color Space Handling

SSIM/MS-SSIM Calculation

Performance Tips

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages