Proceedings of the 6th Computer Science Research Days, JRI 2023, 18-20 December 2023, Ouagadougou, Burkina Faso

Research Article

Speech Processing: a literature review

Download78 downloads
  • @INPROCEEDINGS{10.4108/eai.18-12-2023.2348132,
        author={Go Issa  Traore and Borlli Michel Jonas  Some},
        title={Speech Processing: a literature review},
        proceedings={Proceedings of the 6th Computer Science Research Days, JRI 2023, 18-20 December 2023, Ouagadougou, Burkina Faso},
        publisher={EAI},
        proceedings_a={JRI},
        year={2024},
        month={6},
        keywords={speech processing machine learning speech features extractor speech database},
        doi={10.4108/eai.18-12-2023.2348132}
    }
    
  • Go Issa Traore
    Borlli Michel Jonas Some
    Year: 2024
    Speech Processing: a literature review
    JRI
    EAI
    DOI: 10.4108/eai.18-12-2023.2348132
Go Issa Traore1,*, Borlli Michel Jonas Some1
  • 1: Université Nazi BONI
*Contact email: goissatraore@yahoo.fr

Abstract

In this paper, we have focused our research on the state of the knowledge on speech processing and the research perspectives that exist in this domain. This research was conducted on several digital libraries such as IEEE Xplore, ScienceDirect, arXiv, Springer Link, Papers With Code etc. The research focused on the types of speech classification, the techniques used to extract speech features, the Machine Learning (ML) techniques used and the speech data sources available. We found that studies focused mainly on emotion recognition, dialect identification in speech and speaker recognition. Mel Frequency Cepstral Coefficents (MFCC) is the main and most widely used for speech feature extraction. Neural networks dominate as ML techniques for speech classification. Speech databases available have been built in different contexts. Each database is specific to a given language, mainly English, German, Arabic, Chinese and French. There are almost no speech databases for low-resource languages, particularly african languages.