Home // International Journal On Advances in Software, volume 11, numbers 1 and 2, 2018 // View article
Authors:
Zoltán Szabó
Vilmos Bilicki
Árpád Berta
Zoltán Richárd Jánki
Keywords: smartphones; data cleaning; Peer-to-Peer; crowdsourcing
Abstract:
The increasing popularity of smartphones makes them popular tools for various big data collecting crowdsourcing campaigns, but there are still many open questions about the proper methodology of these campaigns. Beyond this, despite the growing popularity of this type of research, there are familiar difficulties and challenges in handling a wide range of uploads, maintaining the quality of the datasets, cleaning the datasets containing noisy, incorrect data, motivating the participants, and providing support for data collection regardless of the remoteness of the device. In order to collect information about the Network Address Translation (NAT)-related environment and the Peer-to-Peer (P2P) networking capabilities of mobile phones, we utilized a crowdsourcing approach. We collected more than 70 million data records from over 100 countries measuring the NAT characteristics of more than 1,300 carriers and over 35,000 WiFi environments during the three-year project. Since then, we have also expanded and released our application to collect even more data concerning the Peer-to-Peer capabilities. Here, we introduce our data collecting and Peer-to-Peer architectures, some of the most prominent problems we have encountered since its launch, some of the solutions and proposed solutions to handle difficulties.
Pages: 120 to 130
Copyright: Copyright (c) to authors, 2018. Used with permission.
Publication date: June 30, 2018
Published in: journal
ISSN: 1942-2628