Optimizing Q-Learning for Dynamic Pricing in Airbnb

Chitra M; S Sangeetha; Anshidha Anish; Sruthika R

Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I

Research Article

Optimizing Q-Learning for Dynamic Pricing in Airbnb

Download10 downloads

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.4108/eai.28-4-2025.2357857,
    author={Chitra  M and S  Sangeetha and Anshidha  Anish and Sruthika  R},
    title={Optimizing Q-Learning for Dynamic Pricing in Airbnb},
    proceedings={Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I},
    publisher={EAI},
    proceedings_a={ICITSM PART I},
    year={2025},
    month={10},
    keywords={dynamic pricing q-learning airbnb multi- agent systems reinforcement learning},
    doi={10.4108/eai.28-4-2025.2357857}
}

Chitra M
S Sangeetha
Anshidha Anish
Sruthika R
Year: 2025
Optimizing Q-Learning for Dynamic Pricing in Airbnb
ICITSM PART I
EAI
DOI: 10.4108/eai.28-4-2025.2357857

Chitra M¹^,*, S Sangeetha¹, Anshidha Anish¹, Sruthika R¹

1: SRM Institute of Science and Technology

*Contact email: chitram2@srmist.edu.in

Abstract

Dynamic pricing is the lifeblood of revenue generation in online marketplaces like Airbnb. Reinforcement learning (RL), in particular Q-learning has been extensively investigated in this problem, but the problem becomes challenging since the state space is huge and the learning dynamics are unstable. This work contributes with a novel, multi-agent Q learning, optimization framework for dynamic pricing that additionally combines adaptive open loop exploration-exploitation strategies and economic based reward functions. To show the efficiency of the proposed model we compare with a simple rule-based pricing rule. Experimental results indicate that our Q-learning model generates higher revenues as well as higher dwelling yield with a higher pricing comparatively to heuristics, the difference between revenues of two methods being statistically significant.

Keywords: dynamic pricing, q-learning, airbnb, multi- agent systems, reinforcement learning

Published: 2025-10-13
Publisher: EAI

: http://dx.doi.org/10.4108/eai.28-4-2025.2357857

Optimizing Q-Learning for Dynamic Pricing in Airbnb

Abstract

About EAI

Community

Publish with EAI