Reinforcement Learning for Reliability Optimisation

被引:0
作者
Saka, Prasuna [1 ]
Banerjee, Ansuman [2 ]
机构
[1] DRDO, Adv Syst Lab, New Delhi, India
[2] Indian Stat Inst, Kolkata, India
来源
THIRTEENTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ADVANCES (ICSEA 2018) | 2018年
关键词
Reliability Optimisation; Reinforcement Learning; Multi-armed bandit; SOFTWARE-RELIABILITY; MODELS;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software Reliability Optimization problem is aimed at bridging the reliability gap in an optimal way. In an industrial setting, focussed testing at the component level is the most favored solution exercised to fill the reliability gap. However, with the increased complexity in the software systems coupled with limited testing timing constraints finding an optimal set of test suite to bridge the reliability gap has become an area of intense research. Furthermore, the stochastic nature of the reliability improvement estimates associated with each test suite manifolds the complexity. Here, we propose Reinforcement Learning (RL), as a mechanism to find an optimal solution. We have shown how an interactive learning is used to estimate the true outcome of the selection and the action selection policy so as to maximise the long term reward. The estimation methodology and the selection policy is inspired by Multi-armed bandit solution strategies. Firstly, we employ a sample average estimation technique for deriving the true outcomes. Secondly, a variant of simple greedy algorithm coined as epsilon-greedy algorithm is considered for action selection policy. These two steps are iteratively exercised until the selection criteria converges. The efficacy of the proposed approach is illustrated on a real time case study.
引用
收藏
页码:25 / 32
页数:8
相关论文
共 17 条
  • [1] [Anonymous], 2015, Reinforcement Learning: An Introduction
  • [2] OPTIMIZATION MODELS FOR RELIABILITY OF MODULAR SOFTWARE SYSTEMS
    BERMAN, O
    ASHRAFI, N
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (11) : 1119 - 1123
  • [3] Architecture-based software reliability analysis: Overview and limitations
    Gokhale, Swapna S.
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2007, 4 (01) : 32 - 40
  • [4] Harman M, 2012, IEEE INT CONF AUTOM, P1, DOI 10.1145/2351676.2351678
  • [5] Planning models for software reliability and cost
    Helander, ME
    Zhao, M
    Ohlsson, N
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (06) : 420 - 434
  • [6] Efficient allocation of testing resources for software module testing based on the hyper-geometric distribution software reliability growth model
    Hou, RH
    Kuo, SY
    Chang, YP
    [J]. SEVENTH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 1996, : 289 - 298
  • [7] Kotaiah B., 2012, SURVEY SOFTWARE RELI
  • [8] Lyu M. R., 2001, IEEE T RELIABILITY
  • [9] Software reliability engineering: A roadmap
    Lyu, Michael R.
    [J]. FOSE 2007: FUTURE OF SOFTWARE ENGINEERING, 2007, : 153 - 170
  • [10] Optimization of reliability allocation and testing schedule for software systems
    Lyu, MR
    Rangarajan, S
    vanMoorsel, APA
    [J]. EIGHTH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 1997, : 336 - 347