Research Article
Word Embedding for Text Classification: Efficient CNN and Bi-GRU Fusion Multi Attention Mechanism
@ARTICLE{10.4108/eetsis.3992, author={Yalamanchili Salini and Poluru Eswaraiah and M. Veera Brahmam and Uddagiri Sirisha}, title={Word Embedding for Text Classification: Efficient CNN and Bi-GRU Fusion Multi Attention Mechanism}, journal={EAI Endorsed Transactions on Scalable Information Systems}, volume={10}, number={6}, publisher={EAI}, journal_a={SIS}, year={2023}, month={9}, keywords={Text categorization, Deep learning, Convolution neural network, CNN, Gate recurrent unit, GRU, Attention}, doi={10.4108/eetsis.3992} }
- Yalamanchili Salini
Poluru Eswaraiah
M. Veera Brahmam
Uddagiri Sirisha
Year: 2023
Word Embedding for Text Classification: Efficient CNN and Bi-GRU Fusion Multi Attention Mechanism
SIS
EAI
DOI: 10.4108/eetsis.3992
Abstract
The proposed methodology for the task of text classification involves the utilization of a deep learning algorithm that integrates the characteristics of a fusion model. The present model is comprised of several attention-based Convolutional Neural Networks (CNNs) and Gate Recurrent Units (GRUs) that are organized in a cyclic neural network. The Efficient CNN and Bi-GRU Fusion Multi Attention Mechanism is a method that integrates convolutional neural networks (CNNs) and bidirectional Gated Recurrent Units (Bi-GRUs) with multi-attention mechanisms in order to enhance the efficacy of word embedding for the purpose of text classification. The proposed design facilitates the extraction of both local and global features of textual feature words and employs an attention mechanism to compute the significance of words in text classification. The fusion model endeavors to enhance the performance of text classification tasks by effectively representing text documents through the combination of CNNs, Bi-GRUs, and multi-attention mechanisms. This approach aims to capture both local and global contextual information, thereby improving the model’s ability to process and analyze textual data. Moreover, the amalgamation of diverse models can potentially augment the precision of text categorization. The study involved conducting experiments on various data sets, including the IMDB film review data set and the THUCNews data set. The results of the study demonstrate that the proposed model exhibits superior performance compared to previous models that relied solely on CNN, LSTM, or fusion models that integrated these architectures. This superiority is evident in terms of accuracy, recall rate, and F1 score.
Copyright © 2023 Y. Salini et al., licensed to ICST. This is an open-access article distributed under the terms of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/), which permits unlimited use, distribution and reproduction in any medium so long as the original work is properly cited.