Text Classification Feature Extraction Method Based on Deep Learning for Unbalanced Data Sets

Li Lin; Shu-xin Guo

Advanced Hybrid Information Processing. 4th EAI International Conference, ADHIP 2020, Binzhou, China, September 26-27, 2020, Proceedings, Part I

Research Article

Text Classification Feature Extraction Method Based on Deep Learning for Unbalanced Data Sets

Download

224 downloads

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.1007/978-3-030-67871-5_29,
    author={Li Lin and Shu-xin Guo},
    title={Text Classification Feature Extraction Method Based on Deep Learning for Unbalanced Data Sets},
    proceedings={Advanced Hybrid Information Processing. 4th EAI International Conference, ADHIP 2020, Binzhou, China, September 26-27, 2020, Proceedings, Part I},
    proceedings_a={ADHIP},
    year={2021},
    month={2},
    keywords={Deep learning Unbalanced data sets Text features Classification and extraction},
    doi={10.1007/978-3-030-67871-5_29}
}

Li Lin
Shu-xin Guo
Year: 2021
Text Classification Feature Extraction Method Based on Deep Learning for Unbalanced Data Sets
ADHIP
Springer
DOI: 10.1007/978-3-030-67871-5_29

Li Lin¹^,*, Shu-xin Guo²

1: School of Computer Engineering, Jimei University
2: Jilin University of Finance and Economics

*Contact email: xd220210@163.com

Abstract

In order to fully realize the classified search of text data information, a text classification feature extraction method for imbalanced data sets based on deep learning is proposed. With the help of trestle automatic encoder and depth confidence network, the preliminary definition of text semantic category conditions is completed, and the text semantic classification processing based on depth learning algorithm is realized. On this basis, pre-processing and debugging of text parameters are implemented, and the dimensionality reduction standards related to the text features of the data set to be extracted are established through the expression of the characteristic behavior. The experimental results show that with the application of the new classification feature extraction method, the number of correctly classified documents starts to increase substantially, which meets the practical application requirements for the classification and search of text data information.

Keywords: Deep learning, Unbalanced data sets, Text features, Classification and extraction

Published: 2021-02-03
Appears in: SpringerLink

: http://dx.doi.org/10.1007/978-3-030-67871-5_29

Text Classification Feature Extraction Method Based on Deep Learning for Unbalanced Data Sets

Abstract

About EAI

Community

Publish with EAI