
Research Article
AI-Model For Probabilistic Assessment and Classification of Harmful Digital Content
@INPROCEEDINGS{10.4108/eai.28-4-2025.2357981, author={Ujjawal Kumar and V. Kalpana and Nirmal M and Ashish Ranjan}, title={AI-Model For Probabilistic Assessment and Classification of Harmful Digital Content}, proceedings={Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part II}, publisher={EAI}, proceedings_a={ICITSM PART II}, year={2025}, month={10}, keywords={harmful digital content multi-label classifi- cation logistic regression bert natural language process- ing jigsaw dataset automated content moderation adaptive thresholding ai-based classification}, doi={10.4108/eai.28-4-2025.2357981} }
- Ujjawal Kumar
V. Kalpana
Nirmal M
Ashish Ranjan
Year: 2025
AI-Model For Probabilistic Assessment and Classification of Harmful Digital Content
ICITSM PART II
EAI
DOI: 10.4108/eai.28-4-2025.2357981
Abstract
In today’s digital landscape, the sheer volume of harmful content necessitates smarter, more adaptable automated moderation techniques. This research introduces a novel, two- pronged AI approach designed for the probabilistic identification and categorization of harmful text, such as toxic comments, insults, threats, and obscenity. By thoughtfully integrating the speed and interpretability of logistic regression with the advanced contextual understanding of fine-tuned BERT models on the challenging Jigsaw dataset, we aim to overcome limitations inherent in traditional content moderation methods. Our method- ology uniquely incorporates a dynamic thresholding strategy, employing a grid search optimized for the F1 score, to achieve a more nuanced and balanced classification performance tailored to the specific characteristics of each harmful content category. Our experimental findings demonstrate a significant ability of this combined model to effectively differentiate between safe and harmful online communication, showcasing a promising and adaptive solution for real-time content moderation. Ultimately, this work highlights the substantial potential of intelligently blending machine learning classification techniques within digital platforms to cultivate more secure and inclusive online environ- ments for everyone.