About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part II

Research Article

AI-Model For Probabilistic Assessment and Classification of Harmful Digital Content

Download7 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.4108/eai.28-4-2025.2357981,
        author={Ujjawal  Kumar and V.  Kalpana and Nirmal  M and Ashish  Ranjan},
        title={AI-Model For Probabilistic Assessment and Classification of Harmful Digital Content},
        proceedings={Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part II},
        publisher={EAI},
        proceedings_a={ICITSM PART II},
        year={2025},
        month={10},
        keywords={harmful digital content multi-label classifi- cation logistic regression bert natural language process- ing jigsaw dataset automated content moderation adaptive thresholding ai-based classification},
        doi={10.4108/eai.28-4-2025.2357981}
    }
    
  • Ujjawal Kumar
    V. Kalpana
    Nirmal M
    Ashish Ranjan
    Year: 2025
    AI-Model For Probabilistic Assessment and Classification of Harmful Digital Content
    ICITSM PART II
    EAI
    DOI: 10.4108/eai.28-4-2025.2357981
Ujjawal Kumar1,*, V. Kalpana1, Nirmal M1, Ashish Ranjan1
  • 1: Vel Tech Rangarajan Dr. Sagunthala R & D Institute of Science and Technology
*Contact email: ujjawalsah9801@gmail.com

Abstract

In today’s digital landscape, the sheer volume of harmful content necessitates smarter, more adaptable automated moderation techniques. This research introduces a novel, two- pronged AI approach designed for the probabilistic identification and categorization of harmful text, such as toxic comments, insults, threats, and obscenity. By thoughtfully integrating the speed and interpretability of logistic regression with the advanced contextual understanding of fine-tuned BERT models on the challenging Jigsaw dataset, we aim to overcome limitations inherent in traditional content moderation methods. Our method- ology uniquely incorporates a dynamic thresholding strategy, employing a grid search optimized for the F1 score, to achieve a more nuanced and balanced classification performance tailored to the specific characteristics of each harmful content category. Our experimental findings demonstrate a significant ability of this combined model to effectively differentiate between safe and harmful online communication, showcasing a promising and adaptive solution for real-time content moderation. Ultimately, this work highlights the substantial potential of intelligently blending machine learning classification techniques within digital platforms to cultivate more secure and inclusive online environ- ments for everyone.

Keywords
harmful digital content, multi-label classifi- cation, logistic regression, bert, natural language process- ing, jigsaw dataset, automated content moderation, adaptive thresholding, ai-based classification
Published
2025-10-14
Publisher
EAI
http://dx.doi.org/10.4108/eai.28-4-2025.2357981
Copyright © 2025–2025 EAI
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL