Reinforcement Learning for Reliability Optimisation

Saka, Prasuna; Banerjee, Ansuman

Home // ICSEA 2018, The Thirteenth International Conference on Software Engineering Advances // View article

Reinforcement Learning for Reliability Optimisation

Authors:
Prasuna Saka
Ansuman Banerjee

Keywords: Reliability Optimisation; Reinforcement Learning; Multi-armed Bandit.

Abstract:
Software Reliability Optimization problem is aimed at bridging the reliability gap in an optimal way. In an industrial setting, focussed testing at the component level is the most favored solution exercised to fill the reliability gap. However, with the increased complexity in the software systems coupled with limited testing timing constraints finding an optimal set of test suite to bridge the reliability gap has become an area of intense research. Furthermore, the stochastic nature of the reliability improvement estimates associated with each test suite manifolds the complexity. Here, we propose Reinforcement Learning (RL),as a mechanism to find an optimal solution. We have shown how an interactive learning is used to estimate the true outcome of the selection and the action selection policy so as to maximise the long term reward. The estimation methodology and the selection policy is inspired by Multi-armed bandit solution strategies. Firstly, we employ a sample average estimation technique for deriving the true outcomes. Secondly, a variant of simple greedy algorithm coined as epsilon-greedy algorithm is considered for action selection policy. These two steps are iteratively exercised until the selection criteria converges. The efficacy of the proposed approach is illustrated on a real time case study.

Pages: 25 to 32

Copyright: Copyright (c) IARIA, 2018

Publication date: October 14, 2018

Published in: conference

ISSN: 2308-4235

ISBN: 978-1-61208-668-2

Location: Nice, France

Dates: from October 14, 2018 to October 18, 2018