Skip to content

Official implementation of "Provable Defense against Privacy Leakage in Federated Learning from Representation Perspective"

License

Notifications You must be signed in to change notification settings

shuklayash10/Soteria

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Provable-Defense-against-Privacy-Leakage-in-Federated-Learning-from-Representation-Perspective

Official implementation of "Provable Defense against Privacy Leakage in Federated Learning from Representation Perspective"

The paper can be found at https://arxiv.org/pdf/2012.06043.pdf

Abstract

Federated learning (FL) is a popular distributed learning framework that can reduce privacy risks by not explicitly sharing private data. %It can reduce privacy risks. However, recent works have demonstrated that sharing model updates makes FL vulnerable to inference attack. In this work, we show our key observation that the data representation leakage from gradients is the essential cause of privacy leakage in FL. We also provide an analysis of this observation to explain how the data presentation is leaked. Based on this observation, we propose a defense against model inversion attack in FL. The key idea of our defense is learning to perturb data representation such that the quality of the reconstructed data is severely degraded, while FL performance is maintained. In addition, we derive certified robustness guarantee to FL and convergence guarantee to FedAvg after applying our defense. To evaluate our defense, we conduct experiments on MNIST and CIFAR10 for defending against the DLG attack and GS attack. Without sacrificing accuracy, the results demonstrate that our proposed defense can increase the mean squared error between the reconstructed data and the raw data by as much as 160$\times$ for both DLG attack and GS attack, compared with baseline defense methods. Therefore, the privacy of the FL system is significantly improved.

show
Comparing our defense with Gradient Compression defense under GS attack

Code

We provide the implementation of our defense against DLG attack and GS attack. Our code is developed based on DLG original repo and GS original repo.

Setup

pytorch=1.2.0
torchvision=0.4.0

Quick start

DLG attack

For DLG attack, you can change the pruning rate of our defense by changing the percentile parameter in

thresh = np.percentile(deviation_f1_x_norm_sum.flatten().cpu().numpy(), 1)

We also provide the implementation of model compression defense. You can uncomment the corresponding code to try it.

GS attack

For GS attack, you can reproduce the results of the car image in the paper by running

python reconstruct_image.py --target_id=-1 --defense=ours --pruning_rate=60 --save_image

You can try model compression defense by running

python reconstruct_image.py --target_id=-1 --defense=prune --pruning_rate=60 --save_image

Remark

Considering computing efficiency, we use ||r||/||d(f(r))/dX|| to approximate ||r(d(f(r))/dX)^-1|| in the code. You can edit the code to compute ||r(d(f(r))/dX)^-1|| directly, then you can achieve better defense results with higher computation cost.

About

Official implementation of "Provable Defense against Privacy Leakage in Federated Learning from Representation Perspective"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 75.1%
  • Python 24.9%