1st International ICST Conference on Communications and Networking in China

Research Article

An Algorithm of Large-Scale Approximate Multiple String Matching for Network Security

  • @INPROCEEDINGS{10.1109/CHINACOM.2006.344838,
        author={Tian  Song and Yibo Xue and Dongsheng  Wang},
        title={An Algorithm of Large-Scale Approximate Multiple String Matching for Network Security},
        proceedings={1st International ICST Conference on Communications and Networking in China},
        publisher={IEEE},
        proceedings_a={CHINACOM},
        year={2007},
        month={4},
        keywords={},
        doi={10.1109/CHINACOM.2006.344838}
    }
    
  • Tian Song
    Yibo Xue
    Dongsheng Wang
    Year: 2007
    An Algorithm of Large-Scale Approximate Multiple String Matching for Network Security
    CHINACOM
    IEEE
    DOI: 10.1109/CHINACOM.2006.344838
Tian Song1,2,3, Yibo Xue2,3, Dongsheng Wang2,3
  • 1: Department of Computer Science and Technology,
  • 2: Tsinghua National Laboratory for Information Science and Technology,
  • 3: Tsinghua University, China

Abstract

Payload checking has become the basic technique for network security applications, where the exact string matching technology is widely used. But as the game between attackers and defenders goes further into payload confusion, the approximate string matching technology is needed, especially large-scale approximate multiple string matching technology. In this paper, we propose one practical algorithm, LargePEX, for large scale approximate multiple string matching based on edit distance. The algorithm is basically extended from PEX, an algorithm of approximate single string matching, with the idea of filtering and verification. LargePEX is finely designed to fit for large-scale matching using fine grain steps analyses. Some experiments are presented to verify the efficiency of LargePEX. As the results show, for the set of 10k strings, the average network payload checking speed using this algorithm can achieve 25 MBps-40 MBps, enough for 100 Mbps Ethernet. With hardware upgrading, the algorithm is also practical for Gigabit Ethernet. So LargePEX provides a new way for defenders to develop more effective methods to protect valuable resources and prevent intrusions by payload checking.