TY - JOUR
T1 - Wikipedia Workload Analysis for Decentralized Hosting
AU - Urdaneta Paredes, G.A.
AU - Pierre, G.E.O.
AU - van Steen, M.R.
PY - 2009
Y1 - 2009
N2 - We study an access trace containing a sample of Wikipedia's traffic over a 107-day period aiming to identify appropriate replication and distribution strategies in a fully decentralized hosting environment. We perform a global analysis of the whole trace, and a detailed analysis of the requests directed to the English edition of Wikipedia. In our study, we classify client requests and examine aspects such as the number of read and save operations, significant load variations and requests for nonexisting pages. We also review proposed decentralized wiki architectures and discuss how they would handle Wikipedia's workload. We conclude that decentralized architectures must focus on applying techniques to efficiently handle read operations while maintaining consistency and dealing with typical issues on decentralized systems such as churn, unbalanced loads and malicious participating nodes. © 2009 Elsevier B.V. All rights reserved.
AB - We study an access trace containing a sample of Wikipedia's traffic over a 107-day period aiming to identify appropriate replication and distribution strategies in a fully decentralized hosting environment. We perform a global analysis of the whole trace, and a detailed analysis of the requests directed to the English edition of Wikipedia. In our study, we classify client requests and examine aspects such as the number of read and save operations, significant load variations and requests for nonexisting pages. We also review proposed decentralized wiki architectures and discuss how they would handle Wikipedia's workload. We conclude that decentralized architectures must focus on applying techniques to efficiently handle read operations while maintaining consistency and dealing with typical issues on decentralized systems such as churn, unbalanced loads and malicious participating nodes. © 2009 Elsevier B.V. All rights reserved.
U2 - 10.1016/j.comnet.2009.02.019
DO - 10.1016/j.comnet.2009.02.019
M3 - Article
SN - 1389-1286
VL - 53
SP - 1830
EP - 1845
JO - Computer Networks (1999)
JF - Computer Networks (1999)
IS - 11
ER -