5th International ICST Conference on Collaborative Computing: Networking, Applications, Worksharing

Research Article

A decentralized mechanism for application level monitoring of distributed systems

Download558 downloads
  • @INPROCEEDINGS{10.4108/ICST.COLLABORATECOM2009.8336 ,
        author={Constantin Serban and Wenxuan Zhang and Naftaly Minsky},
        title={A decentralized mechanism for application level monitoring of distributed systems},
        proceedings={5th International ICST Conference on Collaborative Computing: Networking, Applications, Worksharing},
        proceedings_a={COLLABORATECOM},
        year={2009},
        month={12},
        keywords={collaborative monitoring self management LGI},
        doi={10.4108/ICST.COLLABORATECOM2009.8336 }
    }
    
  • Constantin Serban
    Wenxuan Zhang
    Naftaly Minsky
    Year: 2009
    A decentralized mechanism for application level monitoring of distributed systems
    COLLABORATECOM
    ICST
    DOI: 10.4108/ICST.COLLABORATECOM2009.8336
Constantin Serban1,*, Wenxuan Zhang2,*, Naftaly Minsky2,*
  • 1: Applied Research, Telcordia Technologies, One Telcordia Drive, Piscataway, NJ 08854
  • 2: Department of Computer Science, Rutgers University, 110 Frelinghuysen Rd., Piscataway, NJ 08854
*Contact email: serban@research.telcordia.com, wzhang@cs.rutgers.edu, minsky@cs.rutgers.edu

Abstract

For a complex distributed system to be dependable, it must be continuously monitored, so that its failures and imperfections can be discovered and corrected in a timely manner. This work is concerned with the monitoring of large, open and heterogeneous systems, at their application level. Our objective is a monitoring technique that satisfies the following properties: scalability with respect to the size of the system and with the complexity of the monitoring task; the ability to deal reliably with heterogeneous components; and the ease and flexibility of deployment. Our approach to monitoring is based on a middleware called law-governed interaction (LGI), which is a decentralized coordination and control mechanism.