Research Article
Interpolating the Missing Values for Multi-Dimensional Spatial-Temporal Sensor Data: A Tensor SVD Approach
@INPROCEEDINGS{10.4108/eai.7-11-2017.2274958, author={Peipei Xu and Wenjie Ruan and Quan Z. Sheng and Tao Gu and Lina Yao}, title={Interpolating the Missing Values for Multi-Dimensional Spatial-Temporal Sensor Data: A Tensor SVD Approach}, proceedings={14th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services}, publisher={ACM}, proceedings_a={MOBIQUITOUS}, year={2018}, month={4}, keywords={sensor data recovery tensor completion t-svd admm}, doi={10.4108/eai.7-11-2017.2274958} }
- Peipei Xu
Wenjie Ruan
Quan Z. Sheng
Tao Gu
Lina Yao
Year: 2018
Interpolating the Missing Values for Multi-Dimensional Spatial-Temporal Sensor Data: A Tensor SVD Approach
MOBIQUITOUS
ACM
DOI: 10.4108/eai.7-11-2017.2274958
Abstract
With the booming of the Internet of Things, enormous number of smart devices/sensors have been deployed in the physical world to monitor our surroundings. Usually those devices generate high- dimensional geo-tagged time-series data. However, these sensor readings are easily missing due to the hardware malfunction, con- nection errors or data corruption, which severely compromise the back-end data analysis. To solve this problem, in this paper we ex- ploit tensor-based Singular Value Decomposition method to recover the missing sensor readings. The main novelty of this paper lies in that, i) our tensor-based recovery method can well capture the multi-dimensional spatial and temporal features by transforming the irregularly deployed sensors into a sensor-array and folding the periodic temporal patterns into multiple time dimensions, ii) it only requires to tune one key parameter in an unsupervised manner, and iii) Tensor Singular Value Decomposition structure is more efficient on representation of high-dimension sensor data than other tensor recovery methods based on tensor’s vectorization or flattening. The experimental results in several real-world one-year air quality and meteorology datasets demonstrate the effectiveness and accuracy of our approach.