
Research Article
BCTM: A Topic Modeling Method Based on External Information
@INPROCEEDINGS{10.1007/978-3-031-40467-2_6, author={Gang Liu and Taiying Wan and Jinfeng Yu and Kai Zhan and Wei Wang}, title={BCTM: A Topic Modeling Method Based on External Information}, proceedings={Broadband Communications, Networks, and Systems. 13th EAI International Conference, BROADNETS 2022, Virtual Event, March 12-13, 2023 Proceedings}, proceedings_a={BROADNETS}, year={2023}, month={7}, keywords={Topic modeling Word embedding External information}, doi={10.1007/978-3-031-40467-2_6} }
- Gang Liu
Taiying Wan
Jinfeng Yu
Kai Zhan
Wei Wang
Year: 2023
BCTM: A Topic Modeling Method Based on External Information
BROADNETS
Springer
DOI: 10.1007/978-3-031-40467-2_6
Abstract
Topic models are often used as intermediate algorithms for text mining and semantic analysis in natural language processing, and have a wide range of functions. However, most of the existing improvements to the topic model use word embedding to improve the accuracy of text modeling, but ignore the external information in the text. This paper proposes a topic model BCTM (Bi-Concept Topic Model) using the word feature information and concept information. Based on the BTM topic model, BCTM introduces word feature information through word vector technology and concept information based on ConceptNet to optimize topic modeling. The construction method of Bi-Concept pair is proposed. Based on ConceptNet semantic network, and the content of text is enriched with concept information. A more accurate topic distribution is obtained through the improved topic model, at the same time, due to the rich feature information, the model is also superior to the baseline model in short text modeling. The experiments prove that the bilingual topic model proposed in this paper has a good performance in modeling accuracy.