ew 20(30): e2

Research Article

Data Driven Prognosis of Cervical Cancer Using Class Balancing and Machine Learning Techniques

Download909 downloads
  • @ARTICLE{10.4108/eai.13-7-2018.164264,
        author={Mamta Arora and Sanjeev Dhawan and Kulvinder Singh},
        title={Data Driven Prognosis of Cervical Cancer Using Class Balancing and Machine Learning Techniques},
        journal={EAI Endorsed Transactions on Energy Web},
        volume={7},
        number={30},
        publisher={EAI},
        journal_a={EW},
        year={2020},
        month={5},
        keywords={Cervical Cancer, Random forest, Support vector machine, K-Nearest Neighbour, Random over-sampling, random undersampling, SMOTE},
        doi={10.4108/eai.13-7-2018.164264}
    }
    
  • Mamta Arora
    Sanjeev Dhawan
    Kulvinder Singh
    Year: 2020
    Data Driven Prognosis of Cervical Cancer Using Class Balancing and Machine Learning Techniques
    EW
    EAI
    DOI: 10.4108/eai.13-7-2018.164264
Mamta Arora1,2,*, Sanjeev Dhawan3, Kulvinder Singh3
  • 1: Ph.D. Research Scholar, Department of Computer Science & Engineering, University Institute of Engineering & Technology (U.I.E.T), Kurukshetra University, Kurukshetra
  • 2: Assistant Professor, Department of Computer Science and Technology, Manav Rachna University, Faridabad
  • 3: Faculty of Computer Science & Engineering, Department of Computer Science & Engineering, University Institute of Engineering & Technology (U.I.E.T), Kurukshetra University, Kurukshetra
*Contact email: imamta.arora@gmail.com

Abstract

INTRODUCTION: With the progression of innovation and its joint effort with health care services, the world has achieved a lot of benefits. AI procedures and machine learning techniques are constantly improving existing statistical methods for better results in the medical field. These improved methods will assist health care providers in providing intelligent medical services.

OBJECTIVES: This Cervical cancer is the fourth most common cancer among the other female cancers. This cancer is preventable with early diagnosis. This reason becomes the motivation of the research work. For efficiently and timely prognosis of cervical cancer require a computer-assisted algorithm.

METHODS: The work demonstrated in this paper contributes to the techniques of machine learning for diagnosing cervical cancer. The machine learning algorithms used in this research are K Nearest Neighbour, Support Vector Machine and Random Forest Tree. These classification algorithms are used with class balancing techniques including undersampling, Oversampling and SMOTE.

RESULTS: The evaluation metrics used for comparative analysis includes accuracy, sensitivity, specificity, negative predicted accuracy, and positive predictive accuracy. The results show the Random Forest algorithm with SMOTE technique delivered more promising results over SVM and KNN for four target variables Schiller, Biopsy, Hinselmann , and Cytology respectively.

CONCLUSION: It is concluded that with the limited amount of data which also suffers from the unbalancing problem the promising results drawn using the proposed model.