VSF-Med: A Vulnerability Scoring Framework for Medical Vision-Language Models

VSF-Med is a comprehensive framework designed to systematically evaluate the safety, reliability, and adversarial robustness of Vision-Language Models (Vision LLMs) in clinical imaging applications. This repository contains the implementation of our evaluation pipeline and experimental results.

Overview

VSF-Med uses an ordinal 0-4 scale across eight vulnerability dimensions:

Prompt injection effectiveness
Jailbreak resilience
Potential confidentiality breach
Risk of misinformation
Denial of service resilience
Persistence of attack effects
Safety bypass success
Impact on medical decision support

We apply this framework to ten clinically motivated adversarial scenarios, ranging from contextual prompt injections to image perturbations, using the MIMIC-CXR dataset.

Repository Structure

VSF-Med/
├── src/                           # Source code
│   ├── config/
│   │   └── default_config.yaml    # Default configuration
│   ├── database/
│   │   └── dbschema.sql           # PostgreSQL database schema
│   ├── models/evaluation/
│   │   └── vulnerability_scoring.py  # VSF-Med scoring framework
│   └── utils/
│       ├── database/database_utils.py    # Database interactions
│       ├── perturbations/
│       │   ├── image_perturbations.py    # Visual perturbation methods
│       │   └── text_perturbations.py     # Text attack methods
│       └── visualization/image_utils.py  # Image analysis utilities
├── notebooks/                     # Main experiment notebooks 
│   ├── 01_data_preparation_adversarial_samples.ipynb   # Data preparation and adversarial samples
│   ├── 02_model_evaluation_chexagent_baseline.ipynb    # CheXagent baseline evaluation
│   ├── 03_model_evaluation_chexagent_perturbed.ipynb   # CheXagent with perturbed images
│   ├── 04_model_evaluation_gpt_baseline.ipynb          # GPT-4o baseline evaluation
│   ├── 05_vulnerability_scoring_framework.ipynb        # VSF-Med scoring application
│   ├── 06_model_evaluation_claude.ipynb                # Claude model evaluation
│   ├── 07_benchmarking_models.ipynb                    # Cross-model performance comparison
│   └── 08_analysis_radiologist_comparison.ipynb        # Comparison with radiologists
├── templates/                     # Templates for experiments
│   ├── text_attack_templates.txt          # Text attack patterns
│   ├── visual_perturbation_methods.txt    # Visual attack implementations
│   └── vsf_scoring_rubric.txt             # Vulnerability scoring rubric
└── requirements.txt               # Project dependencies

Evaluation Workflow

Data Preparation: Prepare adversarial samples from the MIMIC-CXR dataset, including 5,000 frontal chest X-ray studies stratified by patient demographics and key pathologies.
Adversarial Variant Generation:
- Text attacks: 18 attack categories with 2-4 expert-curated prompt templates each
- Visual attacks: 6 perturbation methods (Gaussian noise, checkerboard, random arrow overlay, Moiré pattern, steganographic hide, LSB extraction)
Model Evaluation: Evaluate multiple vision-language models on both standard and adversarial inputs:
- CheXagent-8b: Specialized medical imaging model
- GPT-4o: General-purpose multimodal model
- Claude: General-purpose multimodal model
Vulnerability Scoring: Apply the VSF-Med framework to score model outputs across the 8 vulnerability dimensions.
Benchmarking: Compare performance across models to identify strengths and weaknesses.
Clinical Comparison: Compare model outputs with radiologist interpretations to assess clinical impact.

Text Attack Categories

Our framework formalizes 18 different attack categories including:

Prompt Injection
Jailbreak Attempts
Confidentiality Breach
Misinformation Generation
Denial-of-Service
Persistence Attacks
Safety Bypass
Semantic Shift
Omission Attacks
Over-Confidence Induction
And more...

Visual Perturbation Methods

We apply six visual perturbation techniques to test model robustness:

Gaussian noise at various levels
Checkerboard overlays (single or tiled)
Moiré patterns with adjustable frequency
Random arrow artifacts
Steganographic information hiding
LSB-plane extraction

Perturbation parameters are optimized via grid search to balance imperceptibility (SSIM ≥ 0.85) with attack potency.

Installation

# Clone the repository
git clone https://github.com/VSF-Med.git
cd VSF-Med

# Create and activate virtual environment (optional but recommended)
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

Usage

Configuration

Copy and customize the default configuration:

cp src/config/default_config.yaml src/config/my_config.yaml

Edit my_config.yaml to set database credentials, API keys, and data paths.

Running Experiments

The experimental workflow is organized in sequential notebooks:

Data Preparation and Adversarial Sample Generation:

jupyter notebook notebooks/01_data_preparation_adversarial_samples.ipynb

Model Baseline Evaluations:

# CheXagent baseline
jupyter notebook notebooks/02_model_evaluation_chexagent_baseline.ipynb

# GPT-4o baseline
jupyter notebook notebooks/04_model_evaluation_gpt_baseline.ipynb

# Claude baseline
jupyter notebook notebooks/06_model_evaluation_claude.ipynb

Adversarial Testing:

# CheXagent with perturbed images
jupyter notebook notebooks/03_model_evaluation_chexagent_perturbed.ipynb

Vulnerability Scoring and Analysis:

# Apply the VSF-Med framework
jupyter notebook notebooks/05_vulnerability_scoring_framework.ipynb

# Compare across models
jupyter notebook notebooks/07_benchmarking_models.ipynb

# Compare with radiologists
jupyter notebook notebooks/08_analysis_radiologist_comparison.ipynb

Requirements

Python 3.8+
API keys:
- OpenAI API key (for GPT-4o access)
- Anthropic API key (for Claude access)
MIMIC-CXR dataset access
PostgreSQL database
Required Python libraries:
- pandas
- numpy
- sqlalchemy
- psycopg2-binary
- openai
- anthropic
- PIL
- cv2
- matplotlib
- scikit-image
- seaborn
- plotly
- nltk

Distributed Experiment Setup

VSF-Med is designed to support distributed experiments across multiple computers:

Database Setup

The project uses a PostgreSQL database to store questions, model responses, and evaluation results:

Setup PostgreSQL: Install and configure PostgreSQL 13+
Create Database Schema: Run the schema in src/database/dbschema.sql
Configure Connection: Update database connection settings in your config file

Cloud Storage for Images

MIMIC-CXR JPG files can be stored in cloud storage or a central location:

Cloud Options:
- Google Cloud Storage
- AWS S3
- Azure Blob Storage
- Shared network drive
Configuration:
- Update paths.data_dir in your config to point to the mounted path
- Ensure proper authentication to access files
- Use relative paths in the database to remain location-agnostic

Using this distributed approach, you can:

Run experiments from multiple machines
Centralize results in a single database
Avoid duplicating the large MIMIC-CXR dataset
Scale processing across multiple computers

Citation

If you use VSF-Med in your research, please cite our paper:

@misc{Sadanandan2025VSFMed,
  title         = {VSF-Med: A Vulnerability Scoring Framework for Medical Vision-Language Models},
  author        = {Sadanandan, Binesh and Behzadan, Vahid},
  year          = {2025},
  eprint        = {2507.00052},
  archivePrefix = {arXiv},
  primaryClass  = {cs.CV},
  doi           = {10.48550/arXiv.2507.00052},
  url           = {https://arxiv.org/abs/2507.00052},
}

License

This project is licensed under the terms of the included LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.idea		.idea
notebooks		notebooks
src		src
templates		templates
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VSF-Med: A Vulnerability Scoring Framework for Medical Vision-Language Models

Overview

Repository Structure

Evaluation Workflow

Text Attack Categories

Visual Perturbation Methods

Installation

Usage

Configuration

Running Experiments

Requirements

Distributed Experiment Setup

Database Setup

Cloud Storage for Images

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

UNHSAILLab/VSF-Med

Folders and files

Latest commit

History

Repository files navigation

VSF-Med: A Vulnerability Scoring Framework for Medical Vision-Language Models

Overview

Repository Structure

Evaluation Workflow

Text Attack Categories

Visual Perturbation Methods

Installation

Usage

Configuration

Running Experiments

Requirements

Distributed Experiment Setup

Database Setup

Cloud Storage for Images

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages