3rd International ICST Conference on Broadband Communications, Networks, and Systems

Research Article

Reinforcement Learning Based Path Selection and Wavelength Selection in Optical Burst Switched Networks

  • @INPROCEEDINGS{10.1109/BROADNETS.2006.4374401,
        author={Y. V Kiran and T.  Venkatesh and C. Siva Ram  Murthy},
        title={Reinforcement Learning Based Path Selection and Wavelength Selection in Optical Burst Switched Networks},
        proceedings={3rd International ICST Conference on Broadband Communications, Networks, and Systems},
        publisher={IEEE},
        proceedings_a={BROADNETS},
        year={2006},
        month={10},
        keywords={},
        doi={10.1109/BROADNETS.2006.4374401}
    }
    
  • Y. V Kiran
    T. Venkatesh
    C. Siva Ram Murthy
    Year: 2006
    Reinforcement Learning Based Path Selection and Wavelength Selection in Optical Burst Switched Networks
    BROADNETS
    IEEE
    DOI: 10.1109/BROADNETS.2006.4374401
Y. V Kiran1,*, T. Venkatesh1,*, C. Siva Ram Murthy1,*
  • 1: Department of Computer Science and Engineering, Indian Institute of Technology Madras 600036 India
*Contact email: vkirany@cse.iitm.ernet.in, tvenku@cse.iitm.ernet.in, murthy@iitm.ac.in

Abstract

Optical burst switching (OBS) is a promising technology that exploits the benefits of optical communication and supports statistical multiplexing of data traffic at a fine granularity making it a suitable technology for the next generation Internet. Development of efficient algorithms for path selection and wavelength selection is crucial in minimizing the burst loss probability (BLP) in OBS networks. In this paper, we present novel Reinforcement Learning algorithms for path selection and wavelength selection in the context of OBS networks. We develop an online path selection algorithm based on Q-learning to minimize the BLP by choosing an optimal path among a set of predetermined routes between every pair of ingress and egress nodes. We also propose a Q-learning algorithm for wavelength selection that selects an optimal wavelength among the available wavelengths in a pre-routed path with an objective of minimizing the BLP. We assume no wavelength conversion and buffering to be available at the core nodes of the OBS network. We simulate the proposed algorithms under dynamic load to demonstrate that they reduce the BLP compared to the best known adaptive techniques for path selection and wavelength selection available in the literature.