Security and Privacy in Communication Networks. 17th EAI International Conference, SecureComm 2021, Virtual Event, September 6–9, 2021, Proceedings, Part I

Research Article

TMT-RF: Tunnel Mixed Traffic Classification Based on Random Forest

  • @INPROCEEDINGS{10.1007/978-3-030-90019-9_21,
        author={Panpan Zhao and Gaopeng Gou and Chang Liu and Yangyang Guan and Mingxin Cui and Gang Xiong},
        title={TMT-RF: Tunnel Mixed Traffic Classification Based on Random Forest},
        proceedings={Security and Privacy in Communication Networks. 17th EAI International Conference, SecureComm 2021, Virtual Event, September 6--9, 2021, Proceedings, Part I},
        proceedings_a={SECURECOMM},
        year={2021},
        month={11},
        keywords={Traffic classification IPSec tunnel Machine learning},
        doi={10.1007/978-3-030-90019-9_21}
    }
    
  • Panpan Zhao
    Gaopeng Gou
    Chang Liu
    Yangyang Guan
    Mingxin Cui
    Gang Xiong
    Year: 2021
    TMT-RF: Tunnel Mixed Traffic Classification Based on Random Forest
    SECURECOMM
    Springer
    DOI: 10.1007/978-3-030-90019-9_21
Panpan Zhao1, Gaopeng Gou1, Chang Liu1, Yangyang Guan1, Mingxin Cui1, Gang Xiong1
  • 1: Chinese Academy of Sciences

Abstract

With the explosive growth of the use of tunnels, network anomaly detection and security management are facing huge challenges, of which the first and an important step is tunnel traffic classification. Previous research on the classification of encrypted traffic is mainly based on machine learning methods using statistical features and deep learning methods using packet arrival time and packet length sequence. However, these works mainly focus on the identification of single application traffic. In a real scenario where a single user uses a tunnel, the traffic within a time may contain multiple applications. Due to the tunnel traffic has the same five-tuple, we can’t get the start and end times of each application. Compared with encrypted application traffic classification, it is more difficult to identify applications in tunnels. In this paper, firstly we propose a TMT-RF framework to identify two mixed applications in IPSec tunnels. Then we introduce the first use of NoiseSplit module to split the traffic and then use a CombineBurst module for the second split. Finally, we collected four mixed traffic data sets of three types to evaluate our proposed method. Experimental results demonstrate that TMT-RF not only achieves a splitting accuracy of 93 in positive-time separation applications, but also outperforms other state-of-the-art methods on the data sets for zero-time separation applications and negative-time separation applications.