Networks for Grid Applications. Third International ICST Conference, GridNets 2009, Athens, Greece, September 8-9, 2009, Revised Selected Papers

Research Article

An Alarms Service for Monitoring Multi-domain Grid Networks

Download
595 downloads
  • @INPROCEEDINGS{10.1007/978-3-642-11733-6_8,
        author={Charaka Palansuriya and Jeremy Nowell and Florian Scharinger and Kostas Kavoussanakis and Arthur Trew},
        title={An Alarms Service for Monitoring Multi-domain Grid Networks},
        proceedings={Networks for Grid Applications. Third International ICST Conference, GridNets 2009, Athens, Greece, September 8-9, 2009, Revised Selected Papers},
        proceedings_a={GRIDNETS},
        year={2012},
        month={6},
        keywords={Alarms multi-domain networks Grid federated networks network monitoring Network Monitoring Working Group (NM-WG)},
        doi={10.1007/978-3-642-11733-6_8}
    }
    
  • Charaka Palansuriya
    Jeremy Nowell
    Florian Scharinger
    Kostas Kavoussanakis
    Arthur Trew
    Year: 2012
    An Alarms Service for Monitoring Multi-domain Grid Networks
    GRIDNETS
    Springer
    DOI: 10.1007/978-3-642-11733-6_8
Charaka Palansuriya1,*, Jeremy Nowell1,*, Florian Scharinger1,*, Kostas Kavoussanakis1,*, Arthur Trew1,*
  • 1: The University of Edinburgh
*Contact email: charaka@epcc.ed.ac.uk, jeremy@epcc.ed.ac.uk, florian@epcc.ed.ac.uk, kavousan@epcc.ed.ac.uk, arthur@epcc.ed.ac.uk

Abstract

Effective monitoring of multi-domain Grid networks is essential to support large operational Grid infrastructures. Timely detection of network problems is an essential part of this monitoring. In order to detect the problems, access to network monitoring data that exists in multiple organisations is necessary. This paper presents an Alarms Service that supports monitoring of such multi-domain Grid networks. The service allows timely detection of networking problems based on pre-defined, user-configurable conditions. The requirements gathered from real users for monitoring the networks are discussed. The paper shows how multi-organisation data access is resolved with the use of a standards-based access mechanism. The architecture of the Alarms Service is discussed, providing the reasons behind the design decisions where appropriate. A description of the current implementation of the Alarms Service and its deployment is provided.