Home // CLOUD COMPUTING 2013, The Fourth International Conference on Cloud Computing, GRIDs, and Virtualization // View article
Authors:
Sergio Hernández
Javier Fabra
Pedro Álvarez
Joaquín Ezpeleta
Keywords: Fault tolerance; Scalability; Cloud computing; Heterogeneous computing infrastructures; Resource management frameworks.
Abstract:
Different mechanisms, such as checkpointing, task replication, alternative tasks execution or task migration among different resources, for instance, have been traditionally applied in (heterogeneous) grid environments for fault-tolerance. Cloud based resources can easily improve both availability and reliability of a given system when used for recovering faulty tasks. In this paper we present how cloud resources have been included in a framework for the execution of scientific workflows and how this has helped in improving the framework in two different aspects: making it more scalable and more reliable, facilitating the application of very effective fault recovery policies.
Pages: 230 to 237
Copyright: Copyright (c) IARIA, 2013
Publication date: May 27, 2013
Published in: conference
ISSN: 2308-4294
ISBN: 978-1-61208-271-4
Location: Valencia, Spain
Dates: from May 27, 2013 to June 1, 2013