Signal Processing and Information Technology. First International Joint Conference, SPIT 2011 and IPC 2011, Amsterdam, The Netherlands, December 1-2, 2011, Revised Selected Papers

Research Article

Tamil to Hindi Machine Transliteration Using Support Vector Machines

Download
497 downloads
  • @INPROCEEDINGS{10.1007/978-3-642-32573-1_44,
        author={S. Keerthana and V. Dhanalakshmi and M. Anand Kumar and V. Ajith and K. Soman},
        title={Tamil to Hindi Machine Transliteration Using Support Vector Machines},
        proceedings={Signal Processing and Information Technology. First International Joint Conference, SPIT 2011 and IPC 2011, Amsterdam, The Netherlands, December 1-2, 2011, Revised Selected Papers},
        proceedings_a={SPIT \& IPC},
        year={2012},
        month={10},
        keywords={Named entities Transliteration Phonetic Alphabet Sequence Labeling Support Vector Machines},
        doi={10.1007/978-3-642-32573-1_44}
    }
    
  • S. Keerthana
    V. Dhanalakshmi
    M. Anand Kumar
    V. Ajith
    K. Soman
    Year: 2012
    Tamil to Hindi Machine Transliteration Using Support Vector Machines
    SPIT & IPC
    Springer
    DOI: 10.1007/978-3-642-32573-1_44
S. Keerthana1,*, V. Dhanalakshmi2,*, M. Anand Kumar1,*, V. Ajith1,*, K. Soman1,*
  • 1: Amrita Vishwa Vidyapeetham
  • 2: SRM University
*Contact email: keerthana.keerthi@gmail.com, dhanagiri@gmail.com, mailtoanandkumar@yahoo.co.in, ajith12485@gmail.com, kp_soman@amrita.edu

Abstract

Transliteration is the process of replacing the characters in one language with the corresponding phonetically equivalent characters of the other language. India is a language diversified country where people speak and understand many languages but does not know the script of some of these languages. Transliteration plays a major role in such cases. Transliteration has been a supporting tool in machine translation and cross language information retrieval systems as most of the proper nouns are out of vocabulary words. In this paper, a sequence learning method for transliterating named entities from Tamil to Hindi is proposed. Through this approach, accuracy obtained is encouraging. This transliteration system can be embedded with Tamil to Hindi machine translation system in future.