Collaborative Computing: Networking, Applications and Worksharing. 4th International Conference, CollaborateCom 2008, Orlando, FL, USA, November 13-16, 2008, Revised Selected Papers

Research Article

Data Quality and Failures Characterization of Sensing Data in Environmental Applications

Download72 downloads
  • @INPROCEEDINGS{10.1007/978-3-642-03354-4_50,
        author={Kewei Sha and Guoxing Zhan and Safwan Al-Omari and Tim Calappi and Weisong Shi and Carol Miller},
        title={Data Quality and Failures Characterization of Sensing Data in Environmental Applications},
        proceedings={Collaborative Computing: Networking, Applications and Worksharing. 4th International Conference, CollaborateCom 2008, Orlando, FL, USA, November 13-16, 2008, Revised Selected Papers},
        proceedings_a={COLLABORATECOM},
        year={2012},
        month={5},
        keywords={},
        doi={10.1007/978-3-642-03354-4_50}
    }
    
  • Kewei Sha
    Guoxing Zhan
    Safwan Al-Omari
    Tim Calappi
    Weisong Shi
    Carol Miller
    Year: 2012
    Data Quality and Failures Characterization of Sensing Data in Environmental Applications
    COLLABORATECOM
    Springer
    DOI: 10.1007/978-3-642-03354-4_50
Kewei Sha1,*, Guoxing Zhan2,*, Safwan Al-Omari2,*, Tim Calappi2,*, Weisong Shi2,*, Carol Miller2,*
  • 1: Oklahoma City University
  • 2: Wayne State University
*Contact email: ksha@okcu.edu, gxzhan@wayne.edu, somari@wayne.edu, tcalappi@wayne.edu, weisong@wayne.edu, cjmiller@wayne.edu

Abstract

Environmental monitoring, targeting at discovering and understanding the environmental laws and changes, is one of the most important sensor network application domains. Environmental monitoring is one of the most important sensor network application domains. The success of those applications is determined by the quality of the collected data. Thus, it is crucial to carefully analyze the collected sensing data, which not only helps us understand the features of monitored field, but also unveil any limitations and opportunities that should be considered in future sensor system design. In this paper, we take an initial step and analyze one-month sensing data collected from a real-world water system surveillance application, focusing on the data similarity, data abnormality and failure patterns. Our major findings include: (1) Information similarity, including pattern similarity and numerical similarity, is very common, which provides a good opportunity to trade off energy efficiency and data quality; (2) Spatial and multi-modality correlation analysis provide a way to evaluate data integrity and to detect conflicting data that usually indicates appearances of sensor malfunction or interesting events; and (3) External harsh environmental conditions may be the most important factor on inflicting failures in environmental applications. Communication failures, mainly caused by lacking of synchronization, contribute the largest portion among all failure types.