Machine Learning and Intelligent Communications. 4th International Conference, MLICOM 2019, Nanjing, China, August 24–25, 2019, Proceedings

Research Article

An Active Noise Correction Graph Embedding Method Based on Active Learning for Graph Noisy Data

Download
102 downloads
  • @INPROCEEDINGS{10.1007/978-3-030-32388-2_36,
        author={Zhiyuan Cui and Donghai Guan and Cong Li and Weiwei Guan and Asad Khattak},
        title={An Active Noise Correction Graph Embedding Method Based on Active Learning for Graph Noisy Data},
        proceedings={Machine Learning and Intelligent Communications. 4th International Conference, MLICOM 2019, Nanjing, China, August 24--25, 2019, Proceedings},
        proceedings_a={MLICOM},
        year={2019},
        month={10},
        keywords={Graph embedding Noisy label Active correction Active learning},
        doi={10.1007/978-3-030-32388-2_36}
    }
    
  • Zhiyuan Cui
    Donghai Guan
    Cong Li
    Weiwei Guan
    Asad Khattak
    Year: 2019
    An Active Noise Correction Graph Embedding Method Based on Active Learning for Graph Noisy Data
    MLICOM
    Springer
    DOI: 10.1007/978-3-030-32388-2_36
Zhiyuan Cui,*, Donghai Guan,*, Cong Li,*, Weiwei Guan,*, Asad Khattak1,*
  • 1: Zayed University
*Contact email: 565508802@qq.com, dhguan@nuaa.edu.cn, 18851870127@163.com, yuanweiwei@nuaa.edu.cn, Asad.Khattak@zu.ac.ae

Abstract

In various scenarios of the real world, there are various graph data. Most graph structures are confronted with the problems of complex structure and large consumption of memory space. Graph embedding is an effective method to overcome such challenges, which converts graph structure into a low-dimensional dense vector space. In the real world, label acquisition is expensive, and there may be noise in the data. Therefore, it is important to find valuable noise nodes as much as possible to improve the performance of downstream task. In this paper, we propose a novel active sampling strategy for graph noisy data named Active Noise Correction Graph Embedding method (ANCGE). Given the label budget, the proposed method aims to use semi-supervised graph embedding algorithm to find valuable mislabeled nodes. ANCGE measures the value of noise nodes according to their representativeness and influence on the graph. The experimental results on three open datasets demonstrate the effectiveness of our method and its stability under different noise rates.