5th International ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness

Research Article

Measuring Web Feature Impacts in BitTorrent-like Systems

Download548 downloads
  • @INPROCEEDINGS{10.4108/ICST.QSHINE2008.3906,
        author={Sirui Yang and Hai Jin and Bo Li and Xiaofei Liao and Hong Yao and Qi Huang},
        title={Measuring Web Feature Impacts in BitTorrent-like Systems},
        proceedings={5th International ICST Conference on Heterogeneous Networking for Quality, Reliability, Security and Robustness},
        publisher={ICST},
        proceedings_a={QSHINE},
        year={2010},
        month={5},
        keywords={Peer-to-Peer File sharing Measurement Zipf Highlight effect},
        doi={10.4108/ICST.QSHINE2008.3906}
    }
    
  • Sirui Yang
    Hai Jin
    Bo Li
    Xiaofei Liao
    Hong Yao
    Qi Huang
    Year: 2010
    Measuring Web Feature Impacts in BitTorrent-like Systems
    QSHINE
    ICST
    DOI: 10.4108/ICST.QSHINE2008.3906
Sirui Yang1,*, Hai Jin1,*, Bo Li2,*, Xiaofei Liao1,*, Hong Yao1,*, Qi Huang1,*
  • 1: Service Computing Technology and System Lab, Cluster and Grid Computing Lab, School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China
  • 2: Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
*Contact email: hjin@mail.hust.edu.cn, hjin@mail.hust.edu.cn, bli@ust.hk, hjin@mail.hust.edu.cn, hjin@mail.hust.edu.cn, hjin@mail.hust.edu.cn

Abstract

In Peer-to-Peer (P2P) file sharing systems, the attributes of resource description can influence the user behavior, especially on resource selection. However, this has been only qualitatively speculated but lacks of quantitative analysis. In this paper, we carry out a systematically quantitative study on the impact of these attributes presented in the form of web features, by measuring the largest BitTorrent website in CERNET. The measurement lasts for 31 days, and there are 168,610 records containing 11,228 distinct resources collected. The result is two-fold. On one hand, it confirms the above qualitative speculation; on the other hand, it shows more significant findings: (1) with the highlight feature on popular items, the downloads of each resource yield to a long-tail distribution however deviating from Zipf Law; (2) publications with attracting titles disseminate 1.9 times faster than others; (3) publisher authority feature does not evidently help the system escaping from malicious resources' pervasion; (4) other features such as taxonomy and size also influence users' choice. We further demonstrate the implications of the web feature impact for system designers and potential attackers.