Home // DATA ANALYTICS 2018, The Seventh International Conference on Data Analytics // View article


Efficient Use of Geographical Information Systems for Improving Transport Mode Classification

Authors:
Jorge Rodriguez-Echeverria
Sidharta Gautama
Nico Van de Weghe
Daniel Ochoa
Benhur Ortiz-Jaramillo

Keywords: Transport mode classification; Crowdsourcing; Tracking data; Receiver operating characteristic

Abstract:
Comparison between transport mode classifiers is usually performed without considering imbalanced samples in the dataset. This problem makes performance rates, such as accuracy and precision, not enough to report the performance of a classifier because they represent a cut-off point in the classifier performance curve. Our rule-based method proposes to combine both, the network elements associated with the transport mode to identify, and the elements associated with other means of transport. We performed a comparison between our proposed method and another GPS/GIS-based method, by applying a real-world representative dataset with a target class imbalance. We evaluated the performance of both methods with five experiments, using the area under the Receiver Operating Characteristic curve as metric. The results show that the tested methods achieve the same false positive rate. However, our method identifies correctly 84% of the true positive samples, i.e., the highest performance in our test data (data collected in Belgium). The proposed method can be used as a part of the post-processing chain in transport data to perform transport and traffic analytics in smart cities.

Pages: 130 to 135

Copyright: Copyright (c) IARIA, 2018

Publication date: November 18, 2018

Published in: conference

ISSN: 2308-4464

ISBN: 978-1-61208-681-1

Location: Athens, Greece

Dates: from November 18, 2018 to November 22, 2018