5th International Mobile Multimedia Communications Conference

Research Article

A codebook design method for fricative enhancement in Artificial Bandwidth Extension

Download392 downloads
  • @INPROCEEDINGS{10.4108/ICST.MOBIMEDIA2009.7423,
        author={Michele Sanna and Maurizio Murroni},
        title={A codebook design method for fricative enhancement in Artificial Bandwidth Extension},
        proceedings={5th International Mobile Multimedia Communications Conference},
        publisher={ICST},
        proceedings_a={MOBIMEDIA},
        year={2010},
        month={5},
        keywords={ABWE artificial bandwidth extension speech enhancement wideband speech fricative},
        doi={10.4108/ICST.MOBIMEDIA2009.7423}
    }
    
  • Michele Sanna
    Maurizio Murroni
    Year: 2010
    A codebook design method for fricative enhancement in Artificial Bandwidth Extension
    MOBIMEDIA
    ICST
    DOI: 10.4108/ICST.MOBIMEDIA2009.7423
Michele Sanna1,*, Maurizio Murroni1,*
  • 1: Dipartimento di Ingengeria Elettrica ed Eletronica, Università degli Studi di Cagliari, Piazza d’Armi 09123, Cagliari, Italy.
*Contact email: michele.sanna@diee.unica.it, murroni@diee.unica.it

Abstract

In mobile communications the transmitted speech signals are narrowband, thus sampled at 8 kHz. They are lowpass filtered under 4 kHz and a lot of intelligibility is lost. The goal of Artificial Bandwidth Extension (ABWE) is to recover the lost quality by reconstruction of the voice spectrum between 4 and 8 kHz, bringing thus the superior listening quality and intelligibility of wideband speech. The validity of an algorithm based on a Hidden Markov Model (HMM) has been demonstrated in the majority of speech variety, but resulted quite ineffective in the reconstruction of the fricative consonants. We investigated the causes of inefficient extension of the fricatives and the deriving problems. We developed a codebook design technique which provides a particular emphasis on these sounds in order to improve the fidelity of the reproduction and the dynamic of the processing. Our design improves noticeably the intelligibility of the fricatives. Log-spectral distance measures demonstrate the faithful extension as well as the subjective listening quality and intelligibility.