GitHub - hkscy/MLPdatatrap-example: An illustrative implementation of an MLP "data trap"

An illustrative implementation of a privacy backdoor "data trap" on a small MNIST MLP model based on Privacy Backdoors: Stealing Data with Corrupted Pretrained Models by Feng and Tramèr.

Was written for an article describing the attack and its limitations which you can find on my blog.

Install

Tested only on macOS 13.7.3 and Python 3.10.14.

git clone https://github.com/hkscy/MLPdatatrap-example.git
cd MLPdatatrap-example
conda create --name datatraps --file requirements.txt

Run

Steps below will download MNIST 10, train an MLP on the dataset, backdoor a copy of the model, and then finetune both corrupted and uncorrupted models using each of SGD and Adam optimisation. Plots of the activations, loss gradients, and weight updates are output for all 4 models along with fine-grained data and the recovered (or not) finetuning data.

conda activate datatraps
python datatrap_plot_sgd_adam.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
example_plots		example_plots
.gitignore		.gitignore
README.md		README.md
datatrap_plot_sgd_adam.py		datatrap_plot_sgd_adam.py
neuron_stats.py		neuron_stats.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Install

Run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

hkscy/MLPdatatrap-example

Folders and files

Latest commit

History

Repository files navigation

Install

Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages