Signal Processing and Information Technology. First International Joint Conference, SPIT 2011 and IPC 2011, Amsterdam, The Netherlands, December 1-2, 2011, Revised Selected Papers

Research Article

Clause Boundary Identification for Tamil Language Using Dependency Parsing

Download
431 downloads
  • @INPROCEEDINGS{10.1007/978-3-642-32573-1_32,
        author={R. Dhivya and V. Dhanalakshmi and M. Anand Kumar and K. Soman},
        title={Clause Boundary Identification for Tamil Language Using Dependency Parsing},
        proceedings={Signal Processing and Information Technology. First International Joint Conference, SPIT 2011 and IPC 2011, Amsterdam, The Netherlands, December 1-2, 2011, Revised Selected Papers},
        proceedings_a={SPIT \& IPC},
        year={2012},
        month={10},
        keywords={Natural Language Processing (NLP) Dependency parser Clause boundary Parts of Speech (POS) Shift reduce parser MALT},
        doi={10.1007/978-3-642-32573-1_32}
    }
    
  • R. Dhivya
    V. Dhanalakshmi
    M. Anand Kumar
    K. Soman
    Year: 2012
    Clause Boundary Identification for Tamil Language Using Dependency Parsing
    SPIT & IPC
    Springer
    DOI: 10.1007/978-3-642-32573-1_32
R. Dhivya1,*, V. Dhanalakshmi2,*, M. Anand Kumar1,*, K. Soman1,*
  • 1: Amrita Vishwa Vidyapeetham
  • 2: SRM University
*Contact email: r.dhivya23@gmail.com, dhanagiri@gmail.com, anandkumar@yahoo.co.in, kp_soman@amrita.edu

Abstract

Clause boundary identification is a very important task in natural language processing. Identifying the clauses in the sentence becomes a tough task if the clauses are embedded inside other clauses in the sentence. In our approach, we use the dependency parser to identify the boundary for the clause. The dependency tag set, contains 11 tags, and is useful for identifying the boundary of the clause along with the identification of the subject and object information of the sentence. The MALT parser is used to get the required information about the sentence.