About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
sis 25(5):

Research Article

Life Expectancy Prediction using Recursive Partitioning and Bagging Algorithms

Download6 downloads
Cite
BibTeX Plain Text
  • @ARTICLE{10.4108/eetsis.8959,
        author={Muhammad Bux Alvi and Majdah Alvi and Yasir Hussain  and Wajid Rehman and Kavita Tabassum  and Shahnawaz Farhan  and Fatima Noor },
        title={Life Expectancy Prediction using Recursive Partitioning and Bagging Algorithms},
        journal={EAI Endorsed Transactions on Scalable Information Systems},
        volume={12},
        number={5},
        publisher={EAI},
        journal_a={SIS},
        year={2025},
        month={10},
        keywords={Decision Tree, Life Expectancy, Random Forest algorithm, Public Health Policy, Social Sustainability},
        doi={10.4108/eetsis.8959}
    }
    
  • Muhammad Bux Alvi
    Majdah Alvi
    Yasir Hussain
    Wajid Rehman
    Kavita Tabassum
    Shahnawaz Farhan
    Fatima Noor
    Year: 2025
    Life Expectancy Prediction using Recursive Partitioning and Bagging Algorithms
    SIS
    EAI
    DOI: 10.4108/eetsis.8959
Muhammad Bux Alvi1,*, Majdah Alvi1, Yasir Hussain 2, Wajid Rehman3, Kavita Tabassum 4, Shahnawaz Farhan 5, Fatima Noor 1
  • 1: Islamia University of Bahawalpur
  • 2: Numrex
  • 3: University of Engineering and Technology Lahore
  • 4: Sindh Agriculture University
  • 5: Sindh Energy Department, Pakistan
*Contact email: mbalvi@iub.edu.pk

Abstract

Life expectancy is a crucial indicator of the population’s health and well-being. Recent research has highlighted the importance of various socioeconomic and health factors in determining the lifespan of individuals. Those factors include Gross Domestic Product (GDP), healthcare expenditure, mortality rates, and education level. This study employs recursive partitioning (decision trees) and bagging (random forest) techniques on the Life Expectancy dataset from the World Health Organization (WHO) to evaluate the effectiveness of predictive models. The dataset was prepared by encoding categorical features, scaling the features, normalizing them, and handling outliers. Mean imputation was used to handle missing values and produce a quality dataset. Optimized models based on recursive partitioning and bagging algorithms achieved performance efficiencies of 92% and 97%, respectively. The bagging algorithm-based model produced a mean squared error of 1.17, a mean absolute error of 2.0, and an R2-score of 97%. Other key findings included the importance of dataset characteristics—such as HIV/AIDS prevalence, adult mortality, and health resource income—in predicting life expectancy. This research elucidates the impact of feature engineering and data preprocessing strategies on data quality and predictive model precision, offering novel insights for public health policymaking and informing future research directions.

Keywords
Decision Tree, Life Expectancy, Random Forest algorithm, Public Health Policy, Social Sustainability
Received
2025-03-24
Accepted
2025-10-09
Published
2025-10-13
Publisher
EAI
http://dx.doi.org/10.4108/eetsis.8959

Copyright © 2025 Muhammad Bux Alvi et al., licensed to EAI. This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.

EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL