One pixel attack for fooling deep neural networks

10/24/2017
by   Jiawei Su, et al.
0

Recent research has revealed that the output of Deep Neural Networks (DNN) can be easily altered by adding relatively small perturbations to the input vector. In this paper, we analyze an attack in an extremely limited scenario where only one pixel can be modified. For that we propose a novel method for generating one-pixel adversarial perturbations based on differential evolution. It requires less adversarial information and can fool more types of networks. The results show that 70.97 one target class by modifying just one pixel with 97.47 Thus, the proposed attack explores a different take on adversarial machine learning in an extreme limited scenario, showing that current DNNs are also vulnerable to such low dimension attacks.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset