
Research Article
Multitask Sentiment Analysis and Topic Classification Using BERT
@ARTICLE{10.4108/eetsis.5287, author={Parita Shah and Hiren Patel and Priya Swaminarayan}, title={Multitask Sentiment Analysis and Topic Classification Using BERT}, journal={EAI Endorsed Transactions on Scalable Information Systems}, volume={12}, number={1}, publisher={EAI}, journal_a={SIS}, year={2024}, month={7}, keywords={BERT, Analyzing Sentiments, Categorizing Topics, Multitasking in Learning, Processing Natural Language, Machine Learning Techniques, News Dataset, Retrieval Information}, doi={10.4108/eetsis.5287} }
- Parita Shah
Hiren Patel
Priya Swaminarayan
Year: 2024
Multitask Sentiment Analysis and Topic Classification Using BERT
SIS
EAI
DOI: 10.4108/eetsis.5287
Abstract
In this study, a multitask model is proposed to perform simultaneous news category and sentiment classification of a diverse dataset comprising 3263 news records spanning across eight categories, including environment, health, education, tech, sports, business, lifestyle, and science. Leveraging the power of Bidirectional Encoder Representations from Transformers (BERT), the algorithm demonstrates remarkable results in both tasks. For topic classification, it achieves an accuracy of 98% along with balanced precision and recall, substantiating its proficiency in categorizing news articles. For sentiment analysis, the model maintains strong accuracy at 94%, distinguishing positive from negative sentiment effectively. This multitask approach showcases the model's versatility and its potential to comprehensively understand and classify news articles based on content and sentiment. This multitask model not only enhances classification accuracy but also improves the efficiency of handling extensive news datasets. Consequently, it empowers news agencies, content recommendation systems, and information retrieval services to offer more personalized and pertinent content to their users.
Copyright © 2024 P. Shah et al., licensed to EAI. This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.