Nature of Computation and Communication. Second International Conference, ICTCC 2016, Rach Gia, Vietnam, March 17-18, 2016, Revised Selected Papers

Research Article

Some Efficient Segmentation-Based Techniques to Improve Time Series Discord Discovery

  • @INPROCEEDINGS{10.1007/978-3-319-46909-6_17,
        author={Huynh Thuy and Duong Anh and Vo Chau},
        title={Some Efficient Segmentation-Based Techniques to Improve Time Series Discord Discovery},
        proceedings={Nature of Computation and Communication. Second International Conference, ICTCC 2016, Rach Gia, Vietnam, March 17-18, 2016, Revised Selected Papers},
        proceedings_a={ICTCC},
        year={2017},
        month={1},
        keywords={Time series Discord discovery HOT SAX Segmentation},
        doi={10.1007/978-3-319-46909-6_17}
    }
    
  • Huynh Thuy
    Duong Anh
    Vo Chau
    Year: 2017
    Some Efficient Segmentation-Based Techniques to Improve Time Series Discord Discovery
    ICTCC
    Springer
    DOI: 10.1007/978-3-319-46909-6_17
Huynh Thuy1,*, Duong Anh1,*, Vo Chau1,*
  • 1: Ho Chi Minh City University of Technology
*Contact email: huynhthithuthuy@tdt.edu.vn, dtanh@cse.hcmut.edu.vn, chauvtn@cse.hcmut.edu.vn

Abstract

Time series discord has proved to be a useful concept for time series anomaly detection. To search for discords, various algorithms have been developed. HOT SAX has been considered as a well-known and effective algorithm in time series discord discovery. However this algorithm still has some weaknesses. First, users of HOT SAX are required to choose suitable values for the discord length, word-length and/or alphabet-size, which are unknown. Second, HOT SAX still suffers from high computation cost. In this paper, we propose some novel techniques to improve HOT SAX algorithm. These techniques consist of (i) using some time series segmentation methods to estimate the two important parameters: discord length and word length and (ii) speeding up the discord discovery process by a new way of shifting the sliding window. Extensive experiments have demonstrated that the proposed approach can not only facilitate users in setting the parameters, but also improve the discord discovery in terms of accuracy and computational efficiency.