Research Article
Automatic Voice Activity Detection in Different Speech Applications
@INPROCEEDINGS{10.4108/e-forensics.2008.2781, author={Marko Tuononen and Rosa Gonzalez Hautamaki and Pasi Fr\aa{}nti}, title={Automatic Voice Activity Detection in Different Speech Applications}, proceedings={1st International ICST Conference on Forensic Applications and Techniques in Telecommunications, Information and Multimedia}, publisher={ACM}, proceedings_a={E-FORENSICS}, year={2010}, month={5}, keywords={Voice activity detection speech applicatons unsupervised learning voice biometric and speech recognition}, doi={10.4108/e-forensics.2008.2781} }
- Marko Tuononen
Rosa Gonzalez Hautamaki
Pasi Fränti
Year: 2010
Automatic Voice Activity Detection in Different Speech Applications
E-FORENSICS
ACM
DOI: 10.4108/e-forensics.2008.2781
Abstract
This paper presents performance evaluation of voice activity detectors (VAD) by long-term spectral divergence and simple energy-based scheme. Evaluation is made in the terms of false accept (FA) and false reject (FR) errors using four different types of materials, recorded under different transfer channels, scenarios and conditions. Performance of VADs is considered for forensics, speaker recognition and interactive speech dialogue applications. Performance is still far from perfect, but despite the numerous classification errors of the methods tested, especially with noisy data, the methods can be still useful.
Copyright © 2008–2024 ICST