About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
sis 24(3):

Research Article

Deep Learning Approaches for English-Marathi Code-Switched Detection

Download68 downloads
Cite
BibTeX Plain Text
  • @ARTICLE{10.4108/eetsis.3972,
        author={Shreyash Bhimanwar and Onkar Viralekar and Koustubh Anturkar and Ashwini Kulkarni},
        title={Deep Learning Approaches for English-Marathi Code-Switched Detection},
        journal={EAI Endorsed Transactions on Scalable Information Systems},
        volume={11},
        number={3},
        publisher={EAI},
        journal_a={SIS},
        year={2023},
        month={9},
        keywords={Code-Switching, Deep Learning, Log-Mel Spectogram, Long Short-Term Memory, LSTM, Mel Frequency Cepstral Coefficients, MFCC, Neural Network, Perpetual Linear Prediction, PLP, Spoken Language Identification, Speech Recognition},
        doi={10.4108/eetsis.3972}
    }
    
  • Shreyash Bhimanwar
    Onkar Viralekar
    Koustubh Anturkar
    Ashwini Kulkarni
    Year: 2023
    Deep Learning Approaches for English-Marathi Code-Switched Detection
    SIS
    EAI
    DOI: 10.4108/eetsis.3972
Shreyash Bhimanwar1,*, Onkar Viralekar1, Koustubh Anturkar1, Ashwini Kulkarni1
  • 1: COEP Technicological University
*Contact email: bhimanwarss19.extc@coep.ac.in

Abstract

During a conversation, speakers in multilingual societies frequently switch between two or more spoken languages. A linguistic action known as "code-switching" particularly alters or merges two or more languages. The development of software or tools for detecting code-switching has received very little attention. This paper proposes a Deep Learning based methods for detecting code-switched English-Marathi data. These suggested methods can be applied to various applications, including phone call merging, Intelligent AI assistants, Intelligent travelling systems to assist travellers in navigation and reservations, call centres to handle customer service issues, etc. To create a system for code switch detection, our study demonstrates a detailed analysis of extracting several audio features such as the Mel-Spectrogram, Mel-frequency Cepstral Coefficient (MFCC), and Perceptual Linear Predictive coefficients (PLP). Our team's English-Marathi code-switched dataset served as the testing ground for our methodologies. Our model's accuracy was 92.99%, with 40 MFCC coefficients having energy coefficient serving as the zeroth coefficient.

Keywords
Code-Switching, Deep Learning, Log-Mel Spectogram, Long Short-Term Memory, LSTM, Mel Frequency Cepstral Coefficients, MFCC, Neural Network, Perpetual Linear Prediction, PLP, Spoken Language Identification, Speech Recognition
Received
2023-06-03
Accepted
2023-08-30
Published
2023-09-25
Publisher
EAI
http://dx.doi.org/10.4108/eetsis.3972

Copyright © 2023 S. Bhimanwar et al., licensed to EAI. This is an open access article distributed under the terms of the CC BY-NCSA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.

EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL