
Research Article
A Stable Fine-Grained Webpage Fingerprinting: Aiming at the Unstable Realistic Network
@INPROCEEDINGS{10.1007/978-3-031-64954-7_9, author={Songtao Liu and Hua Wu and Hao Luo and Guang Cheng and Xiaoyan Hu}, title={A Stable Fine-Grained Webpage Fingerprinting: Aiming at the Unstable Realistic Network}, proceedings={Security and Privacy in Communication Networks. 19th EAI International Conference, SecureComm 2023, Hong Kong, China, October 19-21, 2023, Proceedings, Part II}, proceedings_a={SECURECOMM PART 2}, year={2024}, month={10}, keywords={Fine-grained webpage fingerprinting Encrypted traffic classification Machine learning Realistic network}, doi={10.1007/978-3-031-64954-7_9} }
- Songtao Liu
Hua Wu
Hao Luo
Guang Cheng
Xiaoyan Hu
Year: 2024
A Stable Fine-Grained Webpage Fingerprinting: Aiming at the Unstable Realistic Network
SECURECOMM PART 2
Springer
DOI: 10.1007/978-3-031-64954-7_9
Abstract
Website fingerprinting enables attackers to snoop on users’ browsing preferences for websites, even if the network connections are encrypted. Fine-grained webpage fingerprinting can reveal more sensitive privacy by precisely distinguishing pages from the same website. Recognizing similar webpages within the same website needs representative fine-grained features. However, the fluctuating realistic network makes it challenging to ensure the stability of fine-grained features. In this paper, we propose a fine-grained webpage fingerprinting method named Stable Webpage Fingerprinting (StableWPF) to obtain stable webpage features that are not affected by the unstable realistic network. We use the frequency information of the length of the TLS fragments to depict the most representative features. To eliminate the fluctuation of the features, we leverage the kernel density estimation and the Bag-of-Words model in our method. The experimental results in the closed-world and open-world scenarios show that our method outperforms the state-of-the-art approaches in accuracy, precision, and recall. The robust performance obtained on famous real-world websites with various network environments demonstrates the generalization ability of our method.