Home // INFOCOMP 2021, The Eleventh International Conference on Advanced Communications and Computation // View article


An API to Include HPC Resources in Workflow Systems

Authors:
Sven Bingert
Christian Köhler
Hendrik Nolte
Waqar Alamgir

Keywords: HPC, automation, RESTful API, workflow en- gine, data management, provenance, data lake

Abstract:
The demand for processing power by modern data analyses is continuously increasing. High-Performance- Computing (HPC) resources can help but the standard process is for users to log in to use the HPC systems which is often complicated and not well suited for the integration in workflows. In order to bridge the gap between external workflow tools and the usage of HPC resources, we designed and implemented an application interface. This API allows workflow systems to submit HPC jobs along with required artefacts to the queuing system without a direct login of the user. The presented API regards the required safety regulations by ensuring the identity of authorised external workflow systems, as well as the executing HPC systems with a token-based authentication model. In this paper we describe the design of the API and present three use-cases. In the data lake use-case, a novel technique for provenance auditing without runtime overhead is presented which is particularly well suited for HPC systems.

Pages: 15 to 20

Copyright: Copyright (c) IARIA, 2021

Publication date: May 30, 2021

Published in: conference

ISSN: 2308-3484

ISBN: 978-1-61208-865-5

Location: Valencia, Spain

Dates: from May 30, 2021 to June 3, 2021