Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights

Qing Lv; Limin Zheng; Miao Wang

Collaborative Computing: Networking, Applications and Worksharing. 17th EAI International Conference, CollaborateCom 2021, Virtual Event, October 16-18, 2021, Proceedings, Part I

Research Article

Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights

Download

56 downloads

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.1007/978-3-030-92635-9_1,
    author={Qing Lv and Limin Zheng and Miao Wang},
    title={Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights},
    proceedings={Collaborative Computing: Networking, Applications and Worksharing. 17th EAI International Conference, CollaborateCom 2021, Virtual Event, October 16-18, 2021, Proceedings, Part I},
    proceedings_a={COLLABORATECOM},
    year={2022},
    month={1},
    keywords={Named entity recognition Dynamic weight fusion Entity level local CNN BILSTM},
    doi={10.1007/978-3-030-92635-9_1}
}

Qing Lv
Limin Zheng
Miao Wang
Year: 2022
Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights
COLLABORATECOM
Springer
DOI: 10.1007/978-3-030-92635-9_1

Qing Lv¹, Limin Zheng¹^,*, Miao Wang¹

1: College of Information and Electrical Engineering

*Contact email: zhenglimin@cau.edu.cn

Abstract

Named entity recognition is a basic task in NLP, and it is an important basic tool for many NLP tasks such as information extraction, parsing, question answering system and machine translation. The extraction of sequence features of datasets directly affects the recognition effect of named entities, and only the accumulation of local sequence features cannot capture the long distance dependencies. The extraction of global sequence features improves this problem, but loses some local features. Long entities are nested within short entities and have different entity attributes from short entities, resulting in identification errors. To solve these problems, a Chinese named entity recognition algorithm based on Bert +FL-LGWF+CRF is proposed. In this method, the text is encoded into a word vector matrix by Bert as the input to FL-LGWF (Entity Level-Local And Global Weighted Fusion). FL-LGWF utilizes CNN (Convolutional Neural) to extract the local sequence features of the text vector, and use BISTM (Bidirectional Long Short-Term Memory) to extract contextual global sequence features, and perform dynamic weight fusion on the extracted sequence features. Then the score matrix of the tag is obtained according to the entity attribute level. Finally, the global optimal tag sequence is obtained through the CRF layer. Experimental results show that the proposed Bert +FL-LGWF+CRF model has higher F1 value on both public data sets and self-created data sets.

Keywords: Named entity recognition, Dynamic weight fusion, Entity level local, CNN, BILSTM

Published: 2022-01-01
Appears in: SpringerLink

: http://dx.doi.org/10.1007/978-3-030-92635-9_1

Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights

Abstract

About EAI

Community

Publish with EAI