Research Article
UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language
@INPROCEEDINGS{10.1007/978-3-030-05198-3_9, author={John Nyetanyane and Muthoni Masinde}, title={UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language}, proceedings={Emerging Technologies for Developing Countries. Second EAI International Conference, AFRICATEK 2018, Cotonou, Benin, May 29--30, 2018, Proceedings}, proceedings_a={AFRICATEK}, year={2018}, month={12}, keywords={UmobiTalk Automatic speech recognition (ASR) Machine translation (MT) Text to speech (TTS) and parallel corpora}, doi={10.1007/978-3-030-05198-3_9} }
- John Nyetanyane
Muthoni Masinde
Year: 2018
UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language
AFRICATEK
Springer
DOI: 10.1007/978-3-030-05198-3_9
Abstract
The need to conserve the under-resourced languages is becoming more urgent as some of them are becoming extinct; natural language processing can be used to redress this. Currently, most initiatives around language processing technologies are focusing on western languages such as English and French, yet resources for such languages are already available. Sesotho language is one of the under-resourced Bantu languages; it is mostly spoken in Free State province of South Africa and in Lesotho. Like other parts of South Africa, Free State has experienced a high number of non-Sesotho speaking migrants from neighboring provinces and countries. Such people are faced with serious language barrier problems especially in the informal settlements where everyone tends to speak only Sesotho. As a solution to this, we developed a parallel corpus that has English as a source and Sesotho as a target language and packaged it in UmobiTalk - Ubiquitous mobile speech based learning translator. UmobiTalk is a mobile-based tool for learning Sesotho for English speakers. The development of this tool was based on the combination of automatic speech recognition, machine translation and speech synthesis. This application will be used as an analysis tool for testing accuracy and speed of the corpus. We present the development, testing and evaluation of UmobiTalk in this paper.