Deep State Inference: Toward Behavioral Model Inference of Black-Box Software Systems

被引:1
作者
Ataiefard, Foozhan [1 ]
Mashhadi, Mohammad Jafar [1 ]
Hemmati, Hadi [1 ]
Walkinshaw, Neil [2 ]
机构
[1] Univ Calgary, Calgary, AB T2N 1N4, Canada
[2] Univ Sheffield, Sheffield S10 2TN, England
基金
英国工程与自然科学研究理事会; 加拿大自然科学与工程研究理事会;
关键词
Recurrent neural network; convolutional neural network; deep learning; specification mining; transfer learning; black-box model inference; UAV AutoPilot; time series;
D O I
10.1109/TSE.2021.3128820
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many software engineering tasks, such as testing, debugging, and anomaly detection can benefit from the ability to infer a behavioral model of the software. Most existing inference approaches assume access to code to collect execution sequences. In this paper, we investigate a black-box scenario, where the system under analysis cannot be instrumented in this fashion. This scenario is particularly common when it comes to the analysis of control system logs, which often take the form of continuous signals. In this situation, an execution trace amounts to a multivariate time-series of input and output signals, where different states of the system correspond to different "phases" in the time-series. From an inference perspective, the challenge is to detect when these phase changes take place. Unfortunately, most existing solutions are either univariate, make assumptions about the data distribution, or have limited learning power. In this paper we propose a hybrid deep neural network that accepts as input a multivariate time series and applies a set of convolutional and recurrent layers to learn the non-linear correlations between signals and the patterns over time. We show how this approach can be used to accurately detect state changes, and how the inferred models can be successfully applied to transfer-learning scenarios, to accurately process traces from different products with similar execution characteristics. Our experimental results on two UAV autopilot case studies (one industrial and one open-source) indicate that our approach is highly accurate (over 90% F1 score for state classification) and significantly improves baselines (by up to 102% for change point detection). Using transfer learning we also show that up to 90% of the maximum achievable F1 scores in the open-source case study can be achieved by reusing the trained models from the industrial case and only fine tuning them using as low as 5 labeled samples, which reduces the manual labeling effort by 98%.
引用
收藏
页码:4857 / 4872
页数:16
相关论文
共 44 条
[1]   A survey of methods for time series change point detection [J].
Aminikhanghahi, Samaneh ;
Cook, Diane J. .
KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (02) :339-367
[2]   Mining specifications [J].
Ammons, G ;
Bodík, R ;
Larus, JR .
ACM SIGPLAN NOTICES, 2002, 37 (01) :4-16
[3]   Group lassoing change-points in piecewise-constant AR processes [J].
Angelosante, Daniele ;
Giannakis, Georgios B. .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
[4]  
[Anonymous], 2008, FEEDBACK SYSTEMS INT
[5]  
[Anonymous], 2013, PROC 30 INT C MACH L
[6]   Testing for and dating common breaks in multivariate time series [J].
Bai, J ;
Lumsdaine, RL ;
Stock, JH .
REVIEW OF ECONOMIC STUDIES, 1998, 65 (03) :395-432
[7]   The Oracle Problem in Software Testing: A Survey [J].
Barr, Earl T. ;
Harman, Mark ;
McMinn, Phil ;
Shahbaz, Muzammil ;
Yoo, Shin .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2015, 41 (05) :507-525
[8]  
Basseville M., 1993, DETECTION ABRUPT CHA
[9]   Recurrent Neural Networks for Multivariate Time Series with Missing Values [J].
Che, Zhengping ;
Purushotham, Sanjay ;
Cho, Kyunghyun ;
Sontag, David ;
Liu, Yan .
SCIENTIFIC REPORTS, 2018, 8
[10]  
Chen J., 2011, PARAMETRIC STAT CHAN