About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
IoT Technologies for Health Care. 8th EAI International Conference, HealthyIoT 2021, Virtual Event, November 24-26, 2021, Proceedings

Research Article

A Comparative Study of Data Mining Techniques Applied to Renal-Cell Carcinomas

Download(Requires a free EAI acccount)
3 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-030-99197-5_5,
        author={Ana Duarte and Hugo Peixoto and Jos\^{e} Machado},
        title={A Comparative Study of Data Mining Techniques Applied to Renal-Cell Carcinomas},
        proceedings={IoT Technologies for Health Care. 8th EAI International Conference, HealthyIoT 2021, Virtual Event, November 24-26, 2021, Proceedings},
        proceedings_a={HEALTHYIOT},
        year={2022},
        month={3},
        keywords={Renal-Cell Carcinoma Data Mining Survival Life expectancy RapidMiner},
        doi={10.1007/978-3-030-99197-5_5}
    }
    
  • Ana Duarte
    Hugo Peixoto
    José Machado
    Year: 2022
    A Comparative Study of Data Mining Techniques Applied to Renal-Cell Carcinomas
    HEALTHYIOT
    Springer
    DOI: 10.1007/978-3-030-99197-5_5
Ana Duarte1, Hugo Peixoto2,*, José Machado2
  • 1: University of Minho
  • 2: Centro Algoritmi, University of Minho
*Contact email: hpeixoto@di.uminho.pt

Abstract

Despite being one of the deadliest diseases and the enormous evolution in fighting it, the best methods to predict kidney cancer, namely Renal-Cell Carcinomas (RCC), are not well-known. One of the solutions to accelerate the current knowledge about RCC is through the use of Data Mining techniques based on patients' personal and clinical data. Therefore, it is crucial to understand which techniques are the most suitable to extract knowledge about this disease. In this paper, we followed the CRISP-DM methodology to simulate different techniques to determine the ones with the best predictive performance. For this purpose, we used a dataset of 821 records of RCC patients, obtained from The Cancer Genome Atlas. The present work tests different Data Mining techniques, that can be used to predict the 5-year life expectancy of patients with renal cancer and to predict the number of days to death for patients who have a life expectancy of less than 5 years. The results obtained demonstrated that the best algorithm for estimating the vital status at 5 years was Random Forest. This algorithm presented an accuracy of 87.65% and an AUROC of 0.931. For the prediction of days to death, the best performance was obtained with the k-Nearest Neighbors algorithm with a root mean square error of 354.6 days. The work suggested that Data Mining techniques can help to understand the influence of various risk factors on the life expectancy of patients with RCC.

Keywords
Renal-Cell Carcinoma Data Mining Survival Life expectancy RapidMiner
Published
2022-03-23
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-030-99197-5_5
Copyright © 2021–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL