Generating Less Certain Adversarial Examples Improves Robust Generalization

Code for our paper Generating Less Certain Adversarial Examples Improves Robust Generalization by Minxing Zhang, Michael Backes, and Xiao Zhang. In this work, to improve robust generalization, we propose a general method to explicitly Decrease Adversarial Certainty, i.e., DAC.

Abstract

This paper revisits the robust overfitting phenomenon of adversarial training. Observing that models with better robust generalization performance are less certain in predicting adversarially generated training inputs, we argue that overconfidence in predicting adversarial examples is a potential cause. Therefore, we hypothesize that generating less certain adversarial examples improves robust generalization, and propose a formal definition of adversarial certainty that captures the variance of the model's predicted logits on adversarial examples. Our theoretical analysis of synthetic distributions characterizes the connection between adversarial certainty and robust generalization. Accordingly, built upon the notion of adversarial certainty, we develop a general method to search for models that can generate training-time adversarial inputs with reduced certainty, while maintaining the model's capability in distinguishing adversarial examples. Extensive experiments on image benchmarks demonstrate that our method effectively learns models with consistently improved robustness and mitigates robust overfitting, confirming the importance of generating less certain adversarial examples for robust generalization.

News

Oct. 6, 2023 - We created this repo and our code will be released soon.

Oct. 7, 2023 - We uploaded the codes of the DAC-AT on CIFAR-10 and the evaluation by AutoAttack.

Oct. 26, 2023 - We uploaded the codes of the DAC-TRADES and DAC-MART.

Usage

To train DAC-AT on CIFAR-10:

python dac_at_cifar10.py --model {OnWhichModelArchitecture}

To train DAC-TRADES on CIFAR-10:

python dac_trades_cifar10.py --model {OnWhichModelArchitecture}

To train DAC-MART on CIFAR-10:

python dac_mart_cifar10.py --model {OnWhichModelArchitecture}

To evaluate on CIFAR-10:

python eval_autoattack.py --arch {OnWhichModelArchitecture} --data CIFAR10

Trained Model

The PreActResNet-18 trained by DAC-AT on CIFAR-10: https://drive.google.com/file/d/1xC3kAMY5tHWSF3F2NNHRyPt4FerHYX1l/view?usp=sharing.

The WideResNet-34-10 trained by DAC-AT on CIFAR-10: https://drive.google.com/file/d/1yMm_WGLz53ka6rn0x0SgqNTD9fYJxL-Q/view?usp=sharing.

Reference Code

Robust Overfitting: https://github.com/locuslab/robust_overfitting
TRADES: https://github.com/yaodongyu/TRADES
MART: https://github.com/YisenWang/MART/tree/master
AutoAttack: https://github.com/fra31/auto-attack

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
README.md		README.md
dac_at_cifar10.py		dac_at_cifar10.py
dac_mart_cifar10.py		dac_mart_cifar10.py
dac_trades_cifar10.py		dac_trades_cifar10.py
eval_autoattack.py		eval_autoattack.py
mart.py		mart.py
preactresnet.py		preactresnet.py
trades.py		trades.py
utils.py		utils.py
wideresnet.py		wideresnet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Generating Less Certain Adversarial Examples Improves Robust Generalization

Abstract

News

Usage

Trained Model

Reference Code

About

Uh oh!

Releases

Packages

Languages

TrustMLRG/AdvCertainty

Folders and files

Latest commit

History

Repository files navigation

Generating Less Certain Adversarial Examples Improves Robust Generalization

Abstract

News

Usage

Trained Model

Reference Code

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages