11th EAI International Conference on Mobile Multimedia Communications

Research Article

Profiling the Performance Anomalies of Multi-Source Media Downloading at Scale in the Wild

Download609 downloads
  • @INPROCEEDINGS{10.4108/eai.21-6-2018.2276556,
        author={Xi Chen and Xinlei Yang and Zhenhua Li},
        title={Profiling the Performance Anomalies of Multi-Source Media Downloading at Scale in the Wild},
        proceedings={11th EAI International Conference on Mobile Multimedia Communications},
        publisher={EAI},
        proceedings_a={MOBIMEDIA},
        year={2018},
        month={9},
        keywords={multi-source media downloading network performance measurement},
        doi={10.4108/eai.21-6-2018.2276556}
    }
    
  • Xi Chen
    Xinlei Yang
    Zhenhua Li
    Year: 2018
    Profiling the Performance Anomalies of Multi-Source Media Downloading at Scale in the Wild
    MOBIMEDIA
    EAI
    DOI: 10.4108/eai.21-6-2018.2276556
Xi Chen1,*, Xinlei Yang1, Zhenhua Li1
  • 1: Tsinghua University
*Contact email: chenxi198909@gmail.com

Abstract

As one of the most fundamental and pervasive Internet services, me- dia file downloading has undergone several generations of enabling technologies. Unfortunately, the performance today is still far from satisfactory. As the state-of- the-art approach to accelerating media file downloads, multi-source downloading enables user clients to utilize multiple data sources and various content deliv- ery techniques. However, without careful designs, multi-source downloading can result in worse performance with higher overhead, referred to as an anomaly. This paper conducts the first empirical study to quantitatively understand the per- formance anomalies of multi-source media downloading, based on the production logs of a large-scale system serving 179M media file downloads for 37M users (including both PC and mobile users) per day. We reveal the characteristics and root causes of manifold anomalies with regard to seven types of data sources. In particular, 23% of the downloads accelerated by using multiple data sources become slower than the original single-source downloading, and there are sweet spots between the number of data sources used and the download speed. Also, we exploit some unconventional metrics (e.g., diversity of participation time) to explain some counter-intuitive anomalies. Accordingly, we provide a series of practical and applicable implications to effectively address the anomalies.