About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
sis 23(5):

Research Article

Classification model for student dropouts using machine learning: A case study

Download528 downloads
Cite
BibTeX Plain Text
  • @ARTICLE{10.4108/eetsis.vi.3455,
        author={Henry Villarreal-Torres and Julio \^{A}ngeles-Morales and William Mar\^{\i}n-Rodriguez and Daniel Andrade-Gir\^{o}n and Jenny Cano-Mej\^{\i}a and Carmen Mej\^{\i}a-Murillo and Gumercindo Flores-Reyes and Manuel Palomino-M\^{a}rquez},
        title={Classification model for student dropouts using machine learning: A case study},
        journal={EAI Endorsed Transactions on Scalable Information Systems},
        volume={10},
        number={5},
        publisher={EAI},
        journal_a={SIS},
        year={2023},
        month={9},
        keywords={autoML, machine learning, Student dropout, higher education, H2O.ai, data mining},
        doi={10.4108/eetsis.vi.3455}
    }
    
  • Henry Villarreal-Torres
    Julio Ángeles-Morales
    William Marín-Rodriguez
    Daniel Andrade-Girón
    Jenny Cano-Mejía
    Carmen Mejía-Murillo
    Gumercindo Flores-Reyes
    Manuel Palomino-Márquez
    Year: 2023
    Classification model for student dropouts using machine learning: A case study
    SIS
    EAI
    DOI: 10.4108/eetsis.vi.3455
Henry Villarreal-Torres1, Julio Ángeles-Morales1, William Marín-Rodriguez2,*, Daniel Andrade-Girón2, Jenny Cano-Mejía1, Carmen Mejía-Murillo1, Gumercindo Flores-Reyes1, Manuel Palomino-Márquez1
  • 1: Universidad San Pedro
  • 2: Universidad Nacional José Faustino Sánchez Carrión
*Contact email: wmarin@unjfsc.edu.pe

Abstract

Information and communication technologies have been fulfilling a highly relevant role in the different fields of knowledge, addressing problems in various disciplines; there is an increased capacity to identify patterns and anomalies in an organization's data using data mining; In this context, the study aimed to develop a classification model for student dropout, applying machine learning with the autoML method of the H2O.ai framework; the dimensionality of the socioeconomic and academic characteristics has been taken into account, with the purpose that the directors make reasonable decisions to counteract the abandonment of the students in the study programs. The methodology used was of a technological type, purposeful level, incremental innovation, temporal scope, and synchronous; data collection was prospective. For this, a 20-item questionnaire was applied to 237 students enrolled in the master's degree programs in the education of the Graduate School. The research resulted in a supervised machine learning model, Gradient Reinforcement Machine (GBM), to classify student dropout, thus identifying the main associated factors that influence dropout, obtaining a Gini coefficient of 92.20%, AUC of 96.10% and a LogLoss of 24.24% representing a model with efficient performance.

Keywords
autoML, machine learning, Student dropout, higher education, H2O.ai, data mining
Received
2023-09-05
Accepted
2023-09-05
Published
2023-09-05
Publisher
EAI
http://dx.doi.org/10.4108/eetsis.vi.3455

Copyright © 2023 Villarreal-Torres et al., licensed to EAI. This is an open access article distributed under the terms of the CC BYNC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.

EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL