Artificial Intelligence for Communications and Networks. Second EAI International Conference, AICON 2020, Virtual Event, December 19-20, 2020, Proceedings

Research Article

An Improved Generation Method of Adversarial Example to Deceive NLP Deep Learning Classifiers

Download
83 downloads
  • @INPROCEEDINGS{10.1007/978-3-030-69066-3_5,
        author={Fangzhou Yuan and Tianyi Zhang and Xin Liang and Peihang Li and Hongzheng Wang and Mingfeng Lu},
        title={An Improved Generation Method of Adversarial Example to Deceive NLP Deep Learning Classifiers},
        proceedings={Artificial Intelligence for Communications and Networks. Second EAI International Conference, AICON 2020, Virtual Event, December 19-20, 2020, Proceedings},
        proceedings_a={AICON},
        year={2021},
        month={7},
        keywords={Adversarial example Deep learning NLP DeepWordBug algorithm},
        doi={10.1007/978-3-030-69066-3_5}
    }
    
  • Fangzhou Yuan
    Tianyi Zhang
    Xin Liang
    Peihang Li
    Hongzheng Wang
    Mingfeng Lu
    Year: 2021
    An Improved Generation Method of Adversarial Example to Deceive NLP Deep Learning Classifiers
    AICON
    Springer
    DOI: 10.1007/978-3-030-69066-3_5
Fangzhou Yuan1, Tianyi Zhang1, Xin Liang1, Peihang Li1, Hongzheng Wang1, Mingfeng Lu1
  • 1: Beijing Institute of Technology

Abstract

Deep learning has been developed rapidly and widely used over the last decade. However, the concepts of adversarial example and adversarial attack are proposed, that is, adding some perturbations to the input of a deep learning model could easily change the prediction result. Deep learning-based NLP (natural language processing) classification algorithms also have this vulnerability. DeepWordBug algorithm is an advanced algorithm for generating adversarial examples, which can effectively deceive common NLP classification models. However, this algorithm needs to modify too many words to cheat NLP classification models, which limits its applications. In response to the shortcomings of DeepWordBug algorithm, this paper proposes an improving method to improve DeepWordBug. Drawing on the idea of Textfooler algorithm, the improved DeepWordBug adopts the method of dynamically adjusting the number of modified words, limits the maximum number of modified words. The new algorithm greatly reduces the number of words that need to be modified while ensuring the accuracy of NLP classification models as around 30%. It also ensures better practicality while maintaining transferability.