sis 23(5):

Research Article

Semantic Coherence Analysis of English Texts Based on Sentence Semantic Graphs

Download245 downloads
  • @ARTICLE{10.4108/eetsis.3312,
        author={Nanxiao Deng and Yabing Wang and Guimin Huang and Ya Zhou and Yiqun Li},
        title={Semantic Coherence Analysis of English Texts Based on Sentence Semantic Graphs},
        journal={EAI Endorsed Transactions on Scalable Information Systems},
        volume={10},
        number={5},
        publisher={EAI},
        journal_a={SIS},
        year={2023},
        month={8},
        keywords={english text, semantic coherence theory, sentence semantic graph, VF2 subgraph matching algorithm, frequent subgraph},
        doi={10.4108/eetsis.3312}
    }
    
  • Nanxiao Deng
    Yabing Wang
    Guimin Huang
    Ya Zhou
    Yiqun Li
    Year: 2023
    Semantic Coherence Analysis of English Texts Based on Sentence Semantic Graphs
    SIS
    EAI
    DOI: 10.4108/eetsis.3312
Nanxiao Deng1, Yabing Wang1, Guimin Huang1,*, Ya Zhou1, Yiqun Li1
  • 1: Guilin University of Electronic Technology
*Contact email: sendhuang@126.com

Abstract

With the reform of China's education industry, more and more universities are using computers to conduct examinations. For the automatic correction of essays as subjective questions, existing automatic English text scoring systems suffer from insufficient extraction of coherence information and low accuracy when analysing text coherence. Therefore, this paper proposes an unsupervised semantic coherence analysis model for English texts based on sentence semantic graphs, taking Chinese students' English compositions as the research context. Guided by the semantic coherence theory, the English text is represented as a sentence semantic graph, and an improved VF2 subgraph matching algorithm is used to mine the frequently occurring subgraph patterns in the sentence semantic graph. After that, the set of frequent subgraphs is generated by filtering the subgraph patterns according to their frequencies, and the subgraph frequency of each frequent subgraph is calculated separately. Finally, the distribution characteristics of frequent subgraphs and the semantic values of subgraphs in the sentence semantic graphs are extracted to quantify the overall coherence quality of English texts. The experimental results show that the model proposed in this paper has higher accuracy and practical value compared with the current methods of coherence analysis.