Learning Diverse Latent Representations for Improving the Resilience to Adversarial Attacks

06/26/2020
by   Ali Mirzaeian, et al.
0

This paper proposes an ensemble learning model that is resistant to adversarial learning attacks. To build resilience, we proposed a training process where each member learns a radically different latent space. Member models are added one at a time to the ensemble. Each model is trained on data set to improve accuracy, while the loss function is regulated by a reverse knowledge distillation, forcing the new member to learn new features and map to a latent space safely distanced from those of existing members. We have evaluated the reliability and performance of the proposed solution on image classification tasks using CIFAR10 and MNIST datasets and show improved performance compared to the state of the art defense methods

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset