Home // INFOCOMP 2015, The Fifth International Conference on Advanced Communications and Computation // View article
Impact of the Entering Time on Collective Performance
Authors:
Christoph Niethammer
Dmitry Khabi
Huan Zhou
Vladimir Marjanovic
José Gracia
Keywords: collectives; late-arrivals; benchmarking; MPI
Abstract:
Collective operations strongly affect the performance of many MPI applications, as they involve large numbers, or frequently all, of the processes communicating with each other. One critical issue for the performance of collective operations is load imbalance, which causes processes to enter collective operations at different times. The influence of such late-arrivals is not well understood at the moment. Earlier work showed that even small system noise can have a tremendous effect on the collective performance. Thus, although algorithms are optimized for large process counts, they do not seem to tolerate noise or consider delay of involved processes and even a small perturbation from a single process can already have a negative effect on the overall collective execution. In this work, we show a first detailed study about the effect of late arrivals onto the collective performance in MPI. For the evaluation a new, specialized benchmark was designed and a new metric, which we call delay overlap benefit, was used. Our results show, that there is already some potential tolerance to late arrivals - but there is also a lot of room for future optimizations.
Pages: 60 to 65
Copyright: Copyright (c) IARIA, 2015
Publication date: June 21, 2015
Published in: conference
ISSN: 2308-3484
ISBN: 978-1-61208-416-9
Location: Brussels, Belgium
Dates: from June 21, 2015 to June 26, 2015