
Research Article
Optimizing Q-Learning for Dynamic Pricing in Airbnb
@INPROCEEDINGS{10.4108/eai.28-4-2025.2357857, author={Chitra M and S Sangeetha and Anshidha Anish and Sruthika R}, title={Optimizing Q-Learning for Dynamic Pricing in Airbnb}, proceedings={Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part I}, publisher={EAI}, proceedings_a={ICITSM PART I}, year={2025}, month={10}, keywords={dynamic pricing q-learning airbnb multi- agent systems reinforcement learning}, doi={10.4108/eai.28-4-2025.2357857} }
- Chitra M
S Sangeetha
Anshidha Anish
Sruthika R
Year: 2025
Optimizing Q-Learning for Dynamic Pricing in Airbnb
ICITSM PART I
EAI
DOI: 10.4108/eai.28-4-2025.2357857
Abstract
Dynamic pricing is the lifeblood of revenue generation in online marketplaces like Airbnb. Reinforcement learning (RL), in particular Q-learning has been extensively investigated in this problem, but the problem becomes challenging since the state space is huge and the learning dynamics are unstable. This work contributes with a novel, multi-agent Q learning, optimization framework for dynamic pricing that additionally combines adaptive open loop exploration-exploitation strategies and economic based reward functions. To show the efficiency of the proposed model we compare with a simple rule-based pricing rule. Experimental results indicate that our Q-learning model generates higher revenues as well as higher dwelling yield with a higher pricing comparatively to heuristics, the difference between revenues of two methods being statistically significant.