Home // ICIW 2012, The Seventh International Conference on Internet and Web Applications and Services // View article
Characterization of the Wikipedia Traffic
Authors:
Antonio J. Reinoso
Rocío Muñoz-Mansilla
Israel Herraiz
Felipe Ortega
Keywords: Wikipedia; Traffic characterization
Abstract:
Since its inception, Wikipedia has grown to a solid and stable project and turned into a mass collaboration tool that allows the sharing and distribution of knowledge. The wiki approach that basis this initiative promotes the participation and collaboration of users. In addition to visits for browsing its contents, Wikipedia also receives the contributions of users to improve them. In the past, researchers paid attention to different aspects concerning authoring and quality of contents. However, little effort has been made to study the nature of the visits that Wikipedia receives. We conduct such an study using a sample of users' requests provided by the Wikimedia Foundation in the form of Squid log lines. Our sample contains more that 14,000 million requests from users all around the world and directed to all the projects maintained by the Wikimedia Foundation, including different editions of Wikipedia. This papers describes the work made to characterize the traffic directed to Wikipedia and consisting of the requests sent by its users. Our main aim is to obtain a detailed description of its composition in terms of the percentages corresponding to the different types of requests making part of it. The benefits from our work may range from the prediction of traffic peaks to the determination of the kind of resources most often requested, which can be useful for scalability considerations.
Pages: 156 to 162
Copyright: Copyright (c) IARIA, 2012
Publication date: May 27, 2012
Published in: conference
ISSN: 2308-3972
ISBN: 978-1-61208-200-4
Location: Stuttgart, Germany
Dates: from May 27, 2012 to June 1, 2012