Auxiliary Prompt Tuning of Vision-Language Models for Out-of-Distribution Detection (ICCV25)

Requirement

Package

Our experiments are conducted with Python 3.8 and Pytorch 1.8.1.

All required packages are based on CoOp (for training) and MCM (for evaluation). This code is built on top of the awesome toolbox Dassl.pytorch so you need to install the dassl environment first. Simply follow the instructions described here to install dassl as well as PyTorch. After that, run pip install -r requirements.txt under LoCoOp/ to install a few more packages required by CLIP and MCM (this should be done when dassl is activated).

Datasets

Please create data folder and download the following ID and OOD datasets to data.

In-distribution Datasets

We use ImageNet-1K as the ID dataset.

Create a folder named imagenet/ under data folder.
Create images/ under imagenet/.
Download the dataset from the official website and extract the training and validation sets to $DATA/imagenet/images.

Out-of-distribution Datasets

We use the large-scale OOD datasets iNaturalist, SUN, Places, and Texture curated by Huang et al. 2021. We follow instructions from this repository to download the subsampled datasets.

The overall file structure is as follows:

LoCoOp
|-- data
    |-- imagenet
        |-- images/
            |--train/ # contains 1,000 folders like n01440764, n01443537, etc.
            |-- val/ # contains 1,000 folders like n01440764, n01443537, etc.
    |-- iNaturalist
    |-- SUN
    |-- Places
    |-- Texture
    ...
## Quick Start
### We remain the code structure of SCT which will be refine in the future, the core code is in ./trainers/sct/ class sct(TrainerX)

e.g., 1-shot training with ViT-B/16
```train
CUDA_VISIBLE_DEVICES=0 bash scripts/apt/train.sh data imagenet vit_b16_ep25 end 16 1 False 0.25 200

Acknowledgement

We appreciate the following papers for their open-source code, which this repository is built upon.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
clip_w_local		clip_w_local
configs		configs
datasets		datasets
scripts		scripts
trainers		trainers
utils		utils
README.md		README.md
class_list_10.txt		class_list_10.txt
class_list_20.txt		class_list_20.txt
classnames-10.txt		classnames-10.txt
classnames-20.txt		classnames-20.txt
classnames.txt		classnames.txt
create_imagenet_subset.py		create_imagenet_subset.py
eval_ood_detection.py		eval_ood_detection.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Auxiliary Prompt Tuning of Vision-Language Models for Out-of-Distribution Detection (ICCV25)

Requirement

Package

Datasets

In-distribution Datasets

Out-of-distribution Datasets

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

mala-lab/APT

Folders and files

Latest commit

History

Repository files navigation

Auxiliary Prompt Tuning of Vision-Language Models for Out-of-Distribution Detection (ICCV25)

Requirement

Package

Datasets

In-distribution Datasets

Out-of-distribution Datasets

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages