Lipschitz-Certifiable Training with a Tight Outer Bound

keywords: deep learning, robustness, adversarial examples, adversarial training, certified defense

This repository is the official implementation of BCP (Box Constraint Propagation) by Sungyoon Lee, Jaewook Lee, and Saerom Park.

BCP is a certified adversarial defense method against l₂-norm (or l_p-norm, p>0) bounded perturbations. It propagates a box constraint in addition to the Lipschitz outer bound to obtain a tighter outer bound. The propagation scheme is briefly illustrated as follows:

By introducing an additional box constraint, we can cut off the area below the red line from the Lipschitz outer bound (the elliptic area) in the following visualization.

Requirements

To install requirements, modify environment.yml and run this command:

conda env create -f environment.yml

Training (and Evaluation)

To train a certifiably robust model with the proposed method, run this command:

python train_mnist.py
python train_cifar10.py

See demo_cifar10.ipynb for more details. The trained model will be saved in ./models/mnist/ or ./models/cifar10.

Pre-trained Models

You can download pretrained models here:

BCP model trained on MNIST.
BCP model trained on CIFAR-10.

Evaluation of pretrained models

To evaluate the pretrained model, run:

python evaluate_mnist.py --test_pth pretrained/mnist_save.pth
python evaluate_cifar10.py --test_pth pretrained/cifar10_save.pth

Results

Our model achieves the following performance against l₂-perturbation compared to state-of-the-art methods (CAP, LMT, IBP) :

CIFAR-10 (ε₂=36/255)

vertical line: ε₂=36/255

MNIST (ε₂=1.58)

Model name	Standard	PGD^100	Verification
BCP	92.54%	66.23%	48.20%
CAP	88.39%	62.25%	43.95%
LMT	86.48%	53.56%	40.55%

Tiny ImageNet (ε₂=36/255)

Model name	Standard	PGD^100	Verification
BCP	28.76%	26.64%	20.08%

Citing this work

@article{lee2020lipschitz,
  title={Lipschitz-Certifiable Training with a Tight Outer Bound},
  author={Lee, Sungyoon and Lee, Jaewook and Park, Saerom},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

keywords: deep learning, robustness, adversarial examples, adversarial training, certified defense

Name		Name	Last commit message	Last commit date
Latest commit History 159 Commits
media		media
models		models
pretrained		pretrained
BCP.py		BCP.py
LICENSE		LICENSE
README.md		README.md
curve.ipynb		curve.ipynb
curve_cifar10.py		curve_cifar10.py
data_load.py		data_load.py
demo_cifar10.ipynb		demo_cifar10.ipynb
demo_mnist.ipynb		demo_mnist.ipynb
environment.yml		environment.yml
evaluate_cifar10.py		evaluate_cifar10.py
evaluate_mnist.py		evaluate_mnist.py
train_cifar10.py		train_cifar10.py
train_mnist.py		train_mnist.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lipschitz-Certifiable Training with a Tight Outer Bound

Requirements

Training (and Evaluation)

Pre-trained Models

Evaluation of pretrained models

Results

CIFAR-10 (ε₂=36/255)

MNIST (ε₂=1.58)

Tiny ImageNet (ε₂=36/255)

Citing this work

About

Languages

License

sungyoon-lee/bcp

Folders and files

Latest commit

History

Repository files navigation

Lipschitz-Certifiable Training with a Tight Outer Bound

Requirements

Training (and Evaluation)

Pre-trained Models

Evaluation of pretrained models

Results

CIFAR-10 (ε2=36/255)

MNIST (ε2=1.58)

Tiny ImageNet (ε2=36/255)

Citing this work

About

Topics

Resources

License

Stars

Watchers

Forks

Languages

CIFAR-10 (ε₂=36/255)

MNIST (ε₂=1.58)

Tiny ImageNet (ε₂=36/255)