Secure Sparse Coding for Adversarial Defense in Deep Learning

AdversarialDefenseByConvolutionalSparseCoding Bo Sun

Outline • Background • Motivation and Goal • Method • Experiments • Conclusion

Background • Deeplearninghasmadegroundbreakingachievementson… • However,securityproblemsbecomemoreandmoreserious.

Background • Adversarialexamplesare inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake. I. J. Goodfellow, J. Shlens, and C. Szegedy. Explain- ing and harnessing adversarial examples, 2014. CoRR, abs/1412.6572.

Background TwoSources: (a).Abnormal:Outsidethenormaldatamanifold. (our focus) (b).Ambiguous:Obfuscatedlabels. bird -> people dog -> fish ship -> bird 0 or 6? bird or bicycle? 4 or 6? unobvious

Background Common AdversarialAttacks. Clean • FGSM:FastGradientSignMethod • BIM:BasicIterativeMethod • DeepFool • C&W FGSM BIM DeepFool C&W

Background DefenseStrategies • ModifyNetwork: Adversarialtraining:addadversarialexamplestotrainingset. • Modifylearningstrategy: Labelsmoothing:softlabel Networkdistillation • Modifyinputimage: Inputtransformation:cropandrescale,denoising,compression Projection:generativemodels

Motivation • Adversarialimagesdeviatefromnaturalmanifold. • Humancaneasilyfilterperturbation. • Theremustbesomesharedprojectionspaceofcleanandadversarial. • Wecannotenumerateallattacksandnetworks. • Universaldefenseisnecessary.

Goal Universallydefendadversarialexamplesinquasi-naturalspace.

Background s.t. DictionaryLearning/SparseCoding: J. Mairal, F. Bach, and J. Ponce. Sparse modeling for image and vision processing. Foundations and Trends in Computer Graphics and Vision, 8(2-3):85–283, 2014.

Background ConvolutionalDictionaryLearning(CDL): (pre-learned)dictionaryfilters:featuremap:

Method

Experiments CIFAR-10(Resolution:32*32): D. Meng and H. Chen. Magnet: A two-pronged defenseagainst adversarial examples, 2017. In CCS. Y. Song, T. Kim, S. Nowozin, S. Ermon, and N. Kushman. Pixeldefend: Leveraging generative models to understand and defend against adversarial examples, 2018. In ICLR.

Experiments ImageNet(Resolution:224*224):

Experiments ImageNet(Resolution:224*224): C. Guo, M. Rana, M. Cisse, and L. van der Maaten. Countering adversarial images using input transformations, 2018. In ICLR. S.-M. M.-Dezfooli, A. Shrivastava, and O. Tuzel. Divide, denoise, and defend against adversarial attacks, 2018. arXiv preprint, arXiv:1802.06806v1. A. Prakash, N. Moran, S. Garber, A. DiLillo, and J. Storer. Detecting adversarial attacks with pixel deflection, 2018. In CVPR 2018.

Experiments Intrinsic tradeoff between image reconstruction quality and defensive robustness.

Conclusion • We have proposed a novel state-of-the-art attack-agnostic adversarial defense method with additional increased robustness. • We design a novel sparse transformation layer (STL) to project the inputs to a low-dimensional quasi-natural space. • We evaluate the proposed method on CIFAR-10 and ImageNet and show that our defense mechanism provide state-of-the-art results. • We have also provided an analysis of the trade-off between the projection image quality and defense robustness.

Secure Sparse Coding for Adversarial Defense in Deep Learning

Secure Sparse Coding for Adversarial Defense in Deep Learning

Presentation Transcript

Group Sparse Coding

Image classification by sparse coding

Convolutional Codes

Convolutional codes

Sparse Coding and the Homotopy Algorithm

Convolutional encoding

Sparse Coding for Image and Video Understanding

Sparse Coding for Specification Mining and Error Localization

Hearing loss and sparse coding

Convolutional Networks

Adversarial Search

Differentiable Sparse Coding

Adversarial Search

Note 5. Channel Coding: Part 2 : Convolutional Codes

Sparse Coding and Its Extensions for Visual Recognition

Sparse Coding in Sparse Winner networks

Online Learning for Matrix Factorization and Sparse Coding

Convolutional Coding

Quantum Convolutional Coding with Entanglement Assistance

Module 2. “Basics of Transmission Theory and Messages Coding” Lecture 2.2. CONVOLUTIONAL CODING