Emerging Technologies for Developing Countries. Second EAI International Conference, AFRICATEK 2018, Cotonou, Benin, May 29–30, 2018, Proceedings

Research Article

UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language

Download
281 downloads
  • @INPROCEEDINGS{10.1007/978-3-030-05198-3_9,
        author={John Nyetanyane and Muthoni Masinde},
        title={UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language},
        proceedings={Emerging Technologies for Developing Countries. Second EAI International Conference, AFRICATEK 2018, Cotonou, Benin, May 29--30, 2018, Proceedings},
        proceedings_a={AFRICATEK},
        year={2018},
        month={12},
        keywords={UmobiTalk Automatic speech recognition (ASR) Machine translation (MT) Text to speech (TTS) and parallel corpora},
        doi={10.1007/978-3-030-05198-3_9}
    }
    
  • John Nyetanyane
    Muthoni Masinde
    Year: 2018
    UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language
    AFRICATEK
    Springer
    DOI: 10.1007/978-3-030-05198-3_9
John Nyetanyane1,*, Muthoni Masinde1,*
  • 1: Central University of Technology, Free State
*Contact email: jnyetanyane@cut.ac.za, muthonimasinde@yahoo.com

Abstract

The need to conserve the under-resourced languages is becoming more urgent as some of them are becoming extinct; natural language processing can be used to redress this. Currently, most initiatives around language processing technologies are focusing on western languages such as English and French, yet resources for such languages are already available. Sesotho language is one of the under-resourced Bantu languages; it is mostly spoken in Free State province of South Africa and in Lesotho. Like other parts of South Africa, Free State has experienced a high number of non-Sesotho speaking migrants from neighboring provinces and countries. Such people are faced with serious language barrier problems especially in the informal settlements where everyone tends to speak only Sesotho. As a solution to this, we developed a parallel corpus that has English as a source and Sesotho as a target language and packaged it in UmobiTalk - Ubiquitous mobile speech based learning translator. UmobiTalk is a mobile-based tool for learning Sesotho for English speakers. The development of this tool was based on the combination of automatic speech recognition, machine translation and speech synthesis. This application will be used as an analysis tool for testing accuracy and speed of the corpus. We present the development, testing and evaluation of UmobiTalk in this paper.