Using Cloud-based Resources to Improve Availability and Reliability in a Scientific Workflow Execution Framework

Hernández, Sergio; Fabra, Javier; Álvarez, Pedro; Ezpeleta, Joaquín

Home // CLOUD COMPUTING 2013, The Fourth International Conference on Cloud Computing, GRIDs, and Virtualization // View article

Using Cloud-based Resources to Improve Availability and Reliability in a Scientific Workflow Execution Framework

Authors:
Sergio Hernández
Javier Fabra
Pedro Álvarez
Joaquín Ezpeleta

Keywords: Fault tolerance; Scalability; Cloud computing; Heterogeneous computing infrastructures; Resource management frameworks.

Abstract:
Different mechanisms, such as checkpointing, task replication, alternative tasks execution or task migration among different resources, for instance, have been traditionally applied in (heterogeneous) grid environments for fault-tolerance. Cloud based resources can easily improve both availability and reliability of a given system when used for recovering faulty tasks. In this paper we present how cloud resources have been included in a framework for the execution of scientific workflows and how this has helped in improving the framework in two different aspects: making it more scalable and more reliable, facilitating the application of very effective fault recovery policies.

Pages: 230 to 237

Copyright: Copyright (c) IARIA, 2013

Publication date: May 27, 2013

Published in: conference

ISSN: 2308-4294

ISBN: 978-1-61208-271-4

Location: Valencia, Spain

Dates: from May 27, 2013 to June 1, 2013