Sampling Bias in BitTorrent Measurements

Boxun Zhang, Alexandru Iosup, Johan A. Pouwelse, Dick H.J. Epema, Henk J. Sips

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Real-world measurements play an important role in understanding the characteristics and in improving the operation of BitTorrent, which is currently a popular Internet application. Much like measuring the Internet, the complexity and scale of the BitTorrent network make a single, complete measurement impractical. While a large number of measurements have already employed diverse sampling techniques to study parts of BitTorrent network, until now there exists no investigation of their sampling bias, that is, of their ability to objectively represent the characteristics of BitTorrent. In this work we present the first study of the sampling bias in BitTorrent measurements. We first introduce a novel taxonomy of sources of sampling bias in BitTorrent measurements. We then investigate the sampling among fifteen long-term BitTorrent measurements completed between 2004 and 2009, and find that different data sources and measurement techniques can lead to significantly different measurement results. Last, we formulate three recommendations to improve the design of future BitTorrent measurements, and estimate the cost of using these recommendations in practice. © 2010 Springer-Verlag.
Original languageEnglish
Title of host publicationEuro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31 - September 3, 2010, Proceedings, Part I
Pages484-496
Number of pages13
Volume6271 LNCS
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event16th International Euro-Par Conference on Parallel Processing, Euro-Par 2010 - Ischia, Italy
Duration: 31 Aug 20103 Sept 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume6271 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Euro-Par Conference on Parallel Processing, Euro-Par 2010
Country/TerritoryItaly
CityIschia
Period31/08/103/09/10

Fingerprint

Dive into the research topics of 'Sampling Bias in BitTorrent Measurements'. Together they form a unique fingerprint.

Cite this