Research Article
Self-repairing Clusters for Time-Efficient and Scalable Actor-Fault-Tolerance in Wireless Sensor and Actor Networks
@INPROCEEDINGS{10.1007/978-3-642-20931-4_9, author={Loucif Amirouche and Djamel Djenouri and Nadjib Badache}, title={Self-repairing Clusters for Time-Efficient and Scalable Actor-Fault-Tolerance in Wireless Sensor and Actor Networks}, proceedings={Access Networks. 5th International ICST Conference on Access Networks, AccessNets 2010 and First ICST International Workshop on Autonomic Networking and Self-Management in Access Networks, SELFMAGICNETS 2010, Budapest, Hungary, November 3-5, 2010, Revised Selected Papers}, proceedings_a={ACCESSNETS}, year={2012}, month={10}, keywords={}, doi={10.1007/978-3-642-20931-4_9} }
- Loucif Amirouche
Djamel Djenouri
Nadjib Badache
Year: 2012
Self-repairing Clusters for Time-Efficient and Scalable Actor-Fault-Tolerance in Wireless Sensor and Actor Networks
ACCESSNETS
Springer
DOI: 10.1007/978-3-642-20931-4_9
Abstract
A new solution for fault-tolerance in wireless sensor and actor networks (WSAN) is proposed. The solution deals with fault-tolerance of actors, contrary to most of the literature that only considers sensors. It considers real-time communication, and ensures the execution of tasks with low latency despite fault occurrence. A simplified MAMS (multiple-actor multiple-sensor) model is used, where sensed events are duplicated only to a limited number of actors. This is different from the basic MAMS model and semi-passive coordination (SPC), which use data dissemination to all actors for every event. Although it provides high level of fault- tolerance, this large dissemination is costly in terms of power consumption and communication overhead. The proposed solution relies on the construction of self-repairing clusters amongst actors, on which the simplified MAMS is applied. This clustering enables actors to rapidly replace one another whenever some actor breaks down, and eliminates the need of consensus protocol execution upon fault detection, as required by the current approaches to decide which actor should replace the faulty node. The extensive simulation study carried out with TOSSIM in different scenarios shows that the proposed protocol reduces the latency of replacing faulty actors compared to current protocols like SPC. The reduction of the overall delay for executing actions reaches 59%, with very close fault-tolerance (action execution success rate). The difference for this metric does not exceed 8% in the worst case. Scenarios of different network sizes confirm the results and demonstrate the protocol’s scalability.