About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Emerging Technologies for Developing Countries. 5th EAI International Conference, AFRICATEK 2022, Bloemfontein, South Africa, December 5-7, 2022, Proceedings

Research Article

Reinforcement Learning in Education: A Multi-armed Bandit Approach

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-35883-8_1,
        author={Herkulaas MvE Combrink and Vukosi Marivate and Benjamin Rosman},
        title={Reinforcement Learning in Education: A Multi-armed Bandit Approach},
        proceedings={Emerging Technologies for Developing Countries. 5th EAI International Conference, AFRICATEK 2022, Bloemfontein, South Africa, December 5-7, 2022, Proceedings},
        proceedings_a={AFRICATEK},
        year={2023},
        month={7},
        keywords={Autonomous Learning Education Reinforcement Learning Multi-Armed Bandits},
        doi={10.1007/978-3-031-35883-8_1}
    }
    
  • Herkulaas MvE Combrink
    Vukosi Marivate
    Benjamin Rosman
    Year: 2023
    Reinforcement Learning in Education: A Multi-armed Bandit Approach
    AFRICATEK
    Springer
    DOI: 10.1007/978-3-031-35883-8_1
Herkulaas MvE Combrink1,*, Vukosi Marivate1, Benjamin Rosman2
  • 1: Department of Computer Science
  • 2: School of Computer Science and Applied Mathematics
*Contact email: u29191051@tuks.co.za

Abstract

Advances in reinforcement learning research have demonstrated the ways in which different agent-based models can learn how to optimally perform a task within a given environment. Reinforcement leaning solves unsupervised problems where agents move through a state-action-reward loop to maximize the overall reward for the agent, which in turn optimizes the solving of a specific problem in a given environment. However, these algorithms are designed based on our understanding of actions that should be taken in a real-world environment to solve a specific problem. One such problem is the ability to identify, recommend and execute an action within a system where the users are the subject, such as in education. In recent years, the use of blended learning approaches integrating face-to-face learning with online learning in the education context, has increased. Additionally, online platforms used for education require the automation of certain functions such as the identification, recommendation or execution of actions that can benefit the user, in this sense, the student or learner. As promising as these scientific advances are, there is still a need to conduct research in a variety of different areas to ensure the successful deployment of these agents within education systems. Therefore, the aim of this study was to contextualise and simulate the cumulative reward within an environment for an intervention recommendation problem in the education context.

Keywords
Autonomous Learning Education Reinforcement Learning Multi-Armed Bandits
Published
2023-07-06
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-35883-8_1
Copyright © 2022–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL