Research Article
Some Efficient Segmentation-Based Techniques to Improve Time Series Discord Discovery
@INPROCEEDINGS{10.1007/978-3-319-46909-6_17, author={Huynh Thuy and Duong Anh and Vo Chau}, title={Some Efficient Segmentation-Based Techniques to Improve Time Series Discord Discovery}, proceedings={Nature of Computation and Communication. Second International Conference, ICTCC 2016, Rach Gia, Vietnam, March 17-18, 2016, Revised Selected Papers}, proceedings_a={ICTCC}, year={2017}, month={1}, keywords={Time series Discord discovery HOT SAX Segmentation}, doi={10.1007/978-3-319-46909-6_17} }
- Huynh Thuy
Duong Anh
Vo Chau
Year: 2017
Some Efficient Segmentation-Based Techniques to Improve Time Series Discord Discovery
ICTCC
Springer
DOI: 10.1007/978-3-319-46909-6_17
Abstract
Time series discord has proved to be a useful concept for time series anomaly detection. To search for discords, various algorithms have been developed. HOT SAX has been considered as a well-known and effective algorithm in time series discord discovery. However this algorithm still has some weaknesses. First, users of HOT SAX are required to choose suitable values for the discord length, word-length and/or alphabet-size, which are unknown. Second, HOT SAX still suffers from high computation cost. In this paper, we propose some novel techniques to improve HOT SAX algorithm. These techniques consist of (i) using some time series segmentation methods to estimate the two important parameters: discord length and word length and (ii) speeding up the discord discovery process by a new way of shifting the sliding window. Extensive experiments have demonstrated that the proposed approach can not only facilitate users in setting the parameters, but also improve the discord discovery in terms of accuracy and computational efficiency.