A Context-Aware Approach for Textual Adversarial Attack through Probability Difference Guided Beam Search

08/17/2022
by   Huijun Liu, et al.
0

Textual adversarial attacks expose the vulnerabilities of text classifiers and can be used to improve their robustness. Existing context-aware methods solely consider the gold label probability and use the greedy search when searching an attack path, often limiting the attack efficiency. To tackle these issues, we propose PDBS, a context-aware textual adversarial attack model using Probability Difference guided Beam Search. The probability difference is an overall consideration of all class label probabilities, and PDBS uses it to guide the selection of attack paths. In addition, PDBS uses the beam search to find a successful attack path, thus avoiding suffering from limited search space. Extensive experiments and human evaluation demonstrate that PDBS outperforms previous best models in a series of evaluation metrics, especially bringing up to a +19.5 analyses further confirm the efficiency of PDBS.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset