About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Broadband Communications, Networks, and Systems. 13th EAI International Conference, BROADNETS 2022, Virtual Event, March 12-13, 2023 Proceedings

Research Article

BCTM: A Topic Modeling Method Based on External Information

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-40467-2_6,
        author={Gang Liu and Taiying Wan and Jinfeng Yu and Kai Zhan and Wei Wang},
        title={BCTM: A Topic Modeling Method Based on External Information},
        proceedings={Broadband Communications, Networks, and Systems. 13th EAI International Conference, BROADNETS 2022, Virtual Event, March 12-13, 2023 Proceedings},
        proceedings_a={BROADNETS},
        year={2023},
        month={7},
        keywords={Topic modeling Word embedding External information},
        doi={10.1007/978-3-031-40467-2_6}
    }
    
  • Gang Liu
    Taiying Wan
    Jinfeng Yu
    Kai Zhan
    Wei Wang
    Year: 2023
    BCTM: A Topic Modeling Method Based on External Information
    BROADNETS
    Springer
    DOI: 10.1007/978-3-031-40467-2_6
Gang Liu1, Taiying Wan1,*, Jinfeng Yu1, Kai Zhan2, Wei Wang1
  • 1: College of Computer Science and Technology, Harbin Engineering University
  • 2: PwC Enterprise Digital, PricewaterhouseCoopers, Sydney
*Contact email: wantaiying@hrbeu.edu.cn

Abstract

Topic models are often used as intermediate algorithms for text mining and semantic analysis in natural language processing, and have a wide range of functions. However, most of the existing improvements to the topic model use word embedding to improve the accuracy of text modeling, but ignore the external information in the text. This paper proposes a topic model BCTM (Bi-Concept Topic Model) using the word feature information and concept information. Based on the BTM topic model, BCTM introduces word feature information through word vector technology and concept information based on ConceptNet to optimize topic modeling. The construction method of Bi-Concept pair is proposed. Based on ConceptNet semantic network, and the content of text is enriched with concept information. A more accurate topic distribution is obtained through the improved topic model, at the same time, due to the rich feature information, the model is also superior to the baseline model in short text modeling. The experiments prove that the bilingual topic model proposed in this paper has a good performance in modeling accuracy.

Keywords
Topic modeling Word embedding External information
Published
2023-07-30
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-40467-2_6
Copyright © 2023–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL