About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I

Research Article

A Comprehensive Review of Generative AI Techniques for Tamil Natural Language Processing

Download8 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.4108/eai.28-4-2025.2357753,
        author={Kiruthika  S S and Nalinipriya  G},
        title={A Comprehensive Review of Generative AI Techniques for Tamil Natural Language Processing},
        proceedings={Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I},
        publisher={EAI},
        proceedings_a={ICITSM PART I},
        year={2025},
        month={10},
        keywords={generative ai nlp llms llama 2 lora qlora multilingual translation domain-specific nlp bias mitigation cultural sensitivity},
        doi={10.4108/eai.28-4-2025.2357753}
    }
    
  • Kiruthika S S
    Nalinipriya G
    Year: 2025
    A Comprehensive Review of Generative AI Techniques for Tamil Natural Language Processing
    ICITSM PART I
    EAI
    DOI: 10.4108/eai.28-4-2025.2357753
Kiruthika S S1,*, Nalinipriya G1
  • 1: Saveetha Engineering College
*Contact email: sskiruthicse@gmail.com

Abstract

Generative text AI revolutionized NLP by producing massive new advances in low-resource languages. To attain better efficiency at a low computational cost, this works has dedicated to raise the fine-tuning process of largescale language models, such as LLaMA 2, with the integration of techniques like Low-Rank Adaptation (LoRA) and Quantized LoRA (QLoRA). The large amount of code-switching that occurs in CoNLL-17’s Dravidian languages such as Tamil, a low resource, multilingual language, has made it an interesting case to tackle, given the emphasis on issues such as: dialectal variation, code-switching and data scarcity. A model-based vocabulary, a domain-specific dataset, and appropriate metrics have resulted in better performance from generative AI. Content creation for the Tamil language has benefited from utilization of the generative tools, namely cross-lingual adaptation and model combination.

Keywords
generative ai, nlp, llms, llama 2, lora, qlora, multilingual translation, domain-specific nlp, bias mitigation, cultural sensitivity
Published
2025-10-13
Publisher
EAI
http://dx.doi.org/10.4108/eai.28-4-2025.2357753
Copyright © 2025–2025 EAI
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL