Proceedings of the First International Conference on Combinatorial and Optimization, ICCAP 2021, December 7-8 2021, Chennai, India

Research Article

Visual Intelligence in Conversational Solutions for Voice Intelligence Security System (VOISS)

Download287 downloads
  • @INPROCEEDINGS{10.4108/eai.7-12-2021.2314590,
        author={Dr.  Ilayaraja N,MirruduBashini and MirruduBashini  S and Logesh Kumar , S and Sitaraman  Ramachandrula},
        title={Visual Intelligence in Conversational Solutions for Voice Intelligence Security System (VOISS)},
        proceedings={Proceedings of the First International Conference on Combinatorial and Optimization, ICCAP 2021, December 7-8 2021, Chennai, India},
        publisher={EAI},
        proceedings_a={ICCAP},
        year={2021},
        month={12},
        keywords={voice recognition algorithm mel-frequency cepstral coefficients (mfcc) sampling frequency vector quantization k-means clustering audio data features extractions},
        doi={10.4108/eai.7-12-2021.2314590}
    }
    
  • Dr. Ilayaraja N,MirruduBashini
    MirruduBashini S
    Logesh Kumar , S
    Sitaraman Ramachandrula
    Year: 2021
    Visual Intelligence in Conversational Solutions for Voice Intelligence Security System (VOISS)
    ICCAP
    EAI
    DOI: 10.4108/eai.7-12-2021.2314590
Dr. Ilayaraja N,MirruduBashini1,*, MirruduBashini S1, Logesh Kumar , S1, Sitaraman Ramachandrula1
  • 1: PSG College of Technology
*Contact email: nir.mca@psgtech.ac.in

Abstract

Authenticating human beings accurately by a voice Intelligence in Conversational Solutions. VISS is used to improve accuracy level of speech recognition. As people become increasingly comfortable with biometrics, voice authentication is finding wider application across industries, including healthcare, banking, and education. The voice biometrics market is set to grow at an explosive CAGR of 19.4% between 2017 and 2021.The human vocal has been extracted from the video and modelling with the Audio features. The first step of the authentication process is to detect and extract voice features of corresponding voice frames. Finally both the features of voice and video of corresponding lip-movements will be aligned and jointly model to characterize people and authenticate. Speech recognition enables a machining system to convert incoming speech signals into commands by identifying and comprehending them. The primary goal of speech recognition is to improve language communication between humans and machines, making it an excellent human-machine interface technology. Speech recognition technology development encompasses all of the fundamental principles, methods, and classifications of this technology.