About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
sis 24(4):

Research Article

A Novel Ensemble Model for Complex Entities Identification in Low Resource Language

Download104 downloads
Cite
BibTeX Plain Text
  • @ARTICLE{10.4108/eetsis.4434,
        author={Preeti Vats and Nonita Sharma and Deepak Kumar Sharma},
        title={A Novel Ensemble Model for Complex Entities Identification in Low Resource Language},
        journal={EAI Endorsed Transactions on Scalable Information Systems},
        volume={11},
        number={4},
        publisher={EAI},
        journal_a={SIS},
        year={2023},
        month={11},
        keywords={NLP, Ensemble learning, Decision Tree, Hindi Text Identification},
        doi={10.4108/eetsis.4434}
    }
    
  • Preeti Vats
    Nonita Sharma
    Deepak Kumar Sharma
    Year: 2023
    A Novel Ensemble Model for Complex Entities Identification in Low Resource Language
    SIS
    EAI
    DOI: 10.4108/eetsis.4434
Preeti Vats1,*, Nonita Sharma1, Deepak Kumar Sharma1
  • 1: Indira Gandhi Delhi Technical University for Women
*Contact email: preeti017phdit22@igdtuw.ac.in

Abstract

The fundamental method for pre-processing speech or text data that enables computers to comprehend human language is known as natural language processing. Numerous models have been developed to date to pre-process data in the English language; however, the Hindi language does not support these models. India's national tongue is Hindi. In order to help the locals, the authors of this study used supervised learning methods like Linear Regression, SVM, and Naive Bayes algorithm to investigate a dataset of complicated terms in the Hindi language. Additionally, a sophisticated Hindi word classification model is suggested employing several methods based on the forecasts as well as collective learning strategies like Random Forest, Adaboost, and Decision Tree. Depending on how well the user's language is understood, the suggested model will assist in simplifying Hindi text. Authors attempt to classify the uncharted dataset using deep learning algorithms like Bi-LSTM and GRU approaches in further processing.

Keywords
NLP, Ensemble learning, Decision Tree, Hindi Text Identification
Received
2023-10-04
Accepted
2023-11-11
Published
2023-11-21
Publisher
EAI
http://dx.doi.org/10.4108/eetsis.4434

Copyright © 2023 P. Vats et al., licensed to EAI. This is an open-access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.

EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL