About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
phat 22(5): e1

Research Article

Augmentation of Predictive Competence of Non-Small Cell Lung Cancer Datasets through Feature Pre-Processing Techniques

Download377 downloads
Cite
BibTeX Plain Text
  • @ARTICLE{10.4108/eetpht.v8i5.3169,
        author={M. Sumalatha and Latha Parthiban},
        title={Augmentation of Predictive Competence of Non-Small Cell Lung Cancer Datasets through Feature Pre-Processing Techniques},
        journal={EAI Endorsed Transactions on Pervasive Health and Technology},
        volume={8},
        number={5},
        publisher={EAI},
        journal_a={PHAT},
        year={2022},
        month={11},
        keywords={Non-small Cell Lung Cancer, Competency of Prediction, Relevancy Analysis, Regression Analysis, Cluster Analysis, Feature Pre-Processing (FPP Model), Competency Analytics},
        doi={10.4108/eetpht.v8i5.3169}
    }
    
  • M. Sumalatha
    Latha Parthiban
    Year: 2022
    Augmentation of Predictive Competence of Non-Small Cell Lung Cancer Datasets through Feature Pre-Processing Techniques
    PHAT
    EAI
    DOI: 10.4108/eetpht.v8i5.3169
M. Sumalatha1,*, Latha Parthiban2
  • 1: Periyar University
  • 2: Pondicherry University
*Contact email: latha7sumaphd@gmail.com

Abstract

The major Objective of the Study is to augment the predictive analytics of Non-Small Cell Lung Cancer (NSCLC) datasets with Feature Pre-Processing (FPP) technique in three stages viz. Remove base errors with common analytics on emptiness or non-numerical or missing values in the dataset, remove repeated features through regression analysis and eliminate irrelevant features through clustering methods. The FPP Model is validated using classifiers like simple and complex Tree, Linear and Gaussian SVM, Weighted KNN and Boosted Trees in terms of accuracy, sensitivity, specificity, kappa, positive and negative likelihood. The result showed that the NSCLC dataset formed after FPP outperformed the raw NSCLC dataset in all performance levels and showed good augmentation in predictive analytics of NSCLC datasets. The research proved that preprocessing is essential for better prediction of complex medical datasets.

Keywords
Non-small Cell Lung Cancer, Competency of Prediction, Relevancy Analysis, Regression Analysis, Cluster Analysis, Feature Pre-Processing (FPP Model), Competency Analytics
Received
2021-06-23
Accepted
2022-10-10
Published
2022-11-02
Publisher
EAI
http://dx.doi.org/10.4108/eetpht.v8i5.3169

Copyright © 2022 M. Sumalatha et al., licensed to EAI. This is an open access article distributed under the terms of the Creative Commons Attribution license, which permits unlimited use, distribution and reproduction in any medium so long as the original work is cited properly.

EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL