About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Advances of Science and Technology. 7th EAI International Conference, ICAST 2019, Bahir Dar, Ethiopia, August 2–4, 2019, Proceedings

Research Article

Automatic Amharic Part of Speech Tagging (AAPOST): A Comparative Approach Using Bidirectional LSTM and Conditional Random Fields (CRF) Methods

Download601 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-030-43690-2_37,
        author={Worku Birhanie and Miriam Butt},
        title={Automatic Amharic Part of Speech Tagging (AAPOST): A Comparative Approach Using Bidirectional LSTM and Conditional Random Fields (CRF) Methods},
        proceedings={Advances of Science and Technology. 7th EAI International Conference, ICAST 2019, Bahir Dar, Ethiopia, August 2--4, 2019, Proceedings},
        proceedings_a={ICAST},
        year={2020},
        month={6},
        keywords={Amharic POS BI-LSTM CRF},
        doi={10.1007/978-3-030-43690-2_37}
    }
    
  • Worku Birhanie
    Miriam Butt
    Year: 2020
    Automatic Amharic Part of Speech Tagging (AAPOST): A Comparative Approach Using Bidirectional LSTM and Conditional Random Fields (CRF) Methods
    ICAST
    Springer
    DOI: 10.1007/978-3-030-43690-2_37
Worku Birhanie1,*, Miriam Butt2,*
  • 1: Bahir Dar University
  • 2: University of Konstanz
*Contact email: workukelem@gmail.com, miriam.butt@uni-konstanz.de

Abstract

Part of speech (POS) tagging is an initial task for many natural language applications. POS tagging for Amharic is in its infancy. This study contributes towards the improvement of Amharic POS tagging by experimenting using Deep Learning and Conditional Random Fields (CRF) approaches. Word embedding is integrated into the system to enhance performance. The model was applied to an Amharic news corpus tagged into 11 major part of speeches and achieved accuracies of 91.12% and 90% for the Bidirectional LSTM and CRF methods respectively. The result shows that the Bidirectional LSTM approach performance is better than the CRF method. More enhancement is expected in the future by increasing the size and diversity of Amharic corpus.

Keywords
Amharic POS BI-LSTM CRF
Published
2020-06-05
Appears in
ACM Digital Library
http://dx.doi.org/10.1007/978-3-030-43690-2_37
Copyright © 2019–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL