Machine Learning and Intelligent Communications. 4th International Conference, MLICOM 2019, Nanjing, China, August 24–25, 2019, Proceedings

Research Article

High-Dimensional Data Anomaly Detection Framework Based on Feature Extraction of Elastic Network

Download
97 downloads
  • @INPROCEEDINGS{10.1007/978-3-030-32388-2_1,
        author={Yang Shen and Jue Bo and KeXin Li and Shuo Chen and Lin Qiao and Jing Li},
        title={High-Dimensional Data Anomaly Detection Framework Based on Feature Extraction of Elastic Network},
        proceedings={Machine Learning and Intelligent Communications. 4th International Conference, MLICOM 2019, Nanjing, China, August 24--25, 2019, Proceedings},
        proceedings_a={MLICOM},
        year={2019},
        month={10},
        keywords={Elastic network Feature extraction Anomaly detection High-dimensional data Data mining},
        doi={10.1007/978-3-030-32388-2_1}
    }
    
  • Yang Shen
    Jue Bo
    KeXin Li
    Shuo Chen
    Lin Qiao
    Jing Li
    Year: 2019
    High-Dimensional Data Anomaly Detection Framework Based on Feature Extraction of Elastic Network
    MLICOM
    Springer
    DOI: 10.1007/978-3-030-32388-2_1
Yang Shen1, Jue Bo1, KeXin Li2,*, Shuo Chen1, Lin Qiao1, Jing Li2
  • 1: State Grid Liaoning Electric Power Supply Co., Ltd.
  • 2: Nanjing University of Aeronautics and Astronautics
*Contact email: kexinli@nuaa.edu.cn

Abstract

Although appropriate feature extraction can improve the performance of anomaly detection, it is a challenging task due to the complex interaction between features, the mixture of irrelevant features and relevant features, and the unavailability of data tags. When conventional anomaly detection methods deal with the problem of anomaly detection of high dimensional data, the performance of anomaly detection will be degraded due to the existence of irrelevant features. This paper proposed a method of feature extraction and anomaly detection for high dimensional data based on elastic network, which can filter irrelevant features and improve the accuracy and efficiency of anomaly detection. In this paper, an outlier scoring method was used to score the outliers of the original data, and then outliers and the original data were input into the elastic network for sparse regression. After feature extraction of elastic network, those irrelevant features to abnormal data are ignored, thus reducing the dimension of data. Finally, high-dimensional data are detected efficiently according to extracted features. In the experimental stage, we used the high-dimensional anomaly dataset provided by ODDS to detect the performance of the proposed method based on AUC detection accuracy, ROC curve, feature number, convergence speed and other indicators. The results show that the proposed method not only can effectively extract the features related to high-dimensional anomaly data, but also the detection accuracy of outliers has been greatly improved.