An Improved Generation Method of Adversarial Example to Deceive NLP Deep Learning Classifiers

Fangzhou Yuan; Tianyi Zhang; Xin Liang; Peihang Li; Hongzheng Wang; Mingfeng Lu

Artificial Intelligence for Communications and Networks. Second EAI International Conference, AICON 2020, Virtual Event, December 19-20, 2020, Proceedings

Research Article

An Improved Generation Method of Adversarial Example to Deceive NLP Deep Learning Classifiers

Download

170 downloads

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.1007/978-3-030-69066-3_5,
    author={Fangzhou Yuan and Tianyi Zhang and Xin Liang and Peihang Li and Hongzheng Wang and Mingfeng Lu},
    title={An Improved Generation Method of Adversarial Example to Deceive NLP Deep Learning Classifiers},
    proceedings={Artificial Intelligence for Communications and Networks. Second EAI International Conference, AICON 2020, Virtual Event, December 19-20, 2020, Proceedings},
    proceedings_a={AICON},
    year={2021},
    month={7},
    keywords={Adversarial example Deep learning NLP DeepWordBug algorithm},
    doi={10.1007/978-3-030-69066-3_5}
}

Fangzhou Yuan
Tianyi Zhang
Xin Liang
Peihang Li
Hongzheng Wang
Mingfeng Lu
Year: 2021
An Improved Generation Method of Adversarial Example to Deceive NLP Deep Learning Classifiers
AICON
Springer
DOI: 10.1007/978-3-030-69066-3_5

Fangzhou Yuan¹, Tianyi Zhang¹, Xin Liang¹, Peihang Li¹, Hongzheng Wang¹, Mingfeng Lu¹

1: Beijing Institute of Technology

Abstract

Deep learning has been developed rapidly and widely used over the last decade. However, the concepts of adversarial example and adversarial attack are proposed, that is, adding some perturbations to the input of a deep learning model could easily change the prediction result. Deep learning-based NLP (natural language processing) classification algorithms also have this vulnerability. DeepWordBug algorithm is an advanced algorithm for generating adversarial examples, which can effectively deceive common NLP classification models. However, this algorithm needs to modify too many words to cheat NLP classification models, which limits its applications. In response to the shortcomings of DeepWordBug algorithm, this paper proposes an improving method to improve DeepWordBug. Drawing on the idea of Textfooler algorithm, the improved DeepWordBug adopts the method of dynamically adjusting the number of modified words, limits the maximum number of modified words. The new algorithm greatly reduces the number of words that need to be modified while ensuring the accuracy of NLP classification models as around 30%. It also ensures better practicality while maintaining transferability.

Keywords: Adversarial example Deep learning NLP DeepWordBug algorithm

Published: 2021-07-16
Appears in: SpringerLink

: http://dx.doi.org/10.1007/978-3-030-69066-3_5

An Improved Generation Method of Adversarial Example to Deceive NLP Deep Learning Classifiers

Abstract

About EAI

Community

Publish with EAI