Assessing the Safety and Reliability of Autonomous Vehicles from Road Testing

被引:38
作者
Zhao, Xingyu [1 ]
Robu, Valentin [1 ]
Flynn, David [1 ]
Salako, Kizito [2 ]
Strigini, Lorenzo [2 ]
机构
[1] Heriot Watt Univ, Sch Engn & Phys Sci, Edinburgh, Midlothian, Scotland
[2] City Univ London, Ctr Software Reliabil, London, England
来源
2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE) | 2019年
基金
英国工程与自然科学研究理事会;
关键词
autonomous vehicles; reliability claims; statistical testing; safety-critical systems; ultra-high reliability; conservative Bayesian inference; software reliability growth models; ONE CHANNEL; PROBABILITY; SYSTEM; FAILURE; DEMAND;
D O I
10.1109/ISSRE.2019.00012
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
There is an urgent societal need to assess whether autonomous vehicles (AVs) are safe enough. From published quantitative safety and reliability assessments of AVs, we know that, given the goal of predicting very low rates of accidents, road testing alone requires infeasible numbers of miles to be driven. However, previous analyses do not consider any knowledge prior to road testing - knowledge which could bring substantial advantages if the AV design allows strong expectations of safety before road testing. We present the advantages of a new variant of Conservative Bayesian Inference (CBI), which uses prior knowledge while avoiding optimistic biases. We then study the trend of disengagements (take-overs by human drivers) by applying Software Reliability Growth Models (SRGMs) to data from Waymo's public road testing over 51 months, in view of the practice of software updates during this testing. Our approach is to not trust any specific SRGM, but to assess forecast accuracy and then improve forecasts. We show that, coupled with accuracy assessment and recalibration techniques, SRGMs could be a valuable test planning aid.
引用
收藏
页码:13 / 23
页数:11
相关论文
共 48 条
[1]   EVALUATION OF COMPETING SOFTWARE-RELIABILITY PREDICTIONS [J].
ABDELGHALY, AA ;
CHAN, PY ;
LITTLEWOOD, B .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1986, 12 (09) :950-967
[2]  
Anderson James M., 2016, RR-443-2-RC
[3]  
Atwood C., 2003, REPORT NUREG CR 6823
[4]   The Moral Machine experiment [J].
Awad, Edmond ;
Dsouza, Sohan ;
Kim, Richard ;
Schulz, Jonathan ;
Henrich, Joseph ;
Shariff, Azim ;
Bonnefon, Jean-Francois ;
Rahwan, Iyad .
NATURE, 2018, 563 (7729) :59-+
[5]   Hands Off the Wheel in Autonomous Vehicles? A Systems Perspective on over a Million Miles of Field Data [J].
Banerjee, Subho S. ;
Jha, Saurabh ;
Cyriac, James ;
Kalbarczyk, Zbigniew T. ;
Iyer, Ravishankar K. .
2018 48TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2018, :586-597
[6]   A SOFTWARE-RELIABILITY MODEL FOR ARTIFICIAL-INTELLIGENCE PROGRAMS [J].
BASTANI, FB ;
CHEN, IR ;
TSAO, TW .
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 1993, 3 (01) :99-114
[7]   Could Fisher, Jeffreys and Neyman have agreed on testing? [J].
Berger, JO .
STATISTICAL SCIENCE, 2003, 18 (01) :1-12
[8]   Deriving a frequentist conservative confidence bound for probability of failure per demand for systems with different operational and test profiles [J].
Bishop, Peter ;
Povyakalo, Andrey .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2017, 158 :246-253
[9]   Toward a Formalism for Conservative Claims about the Dependability of Software-Based Systems [J].
Bishop, Peter ;
Bloomfield, Robin ;
Littlewood, Bev ;
Povyakalo, Andrey ;
Wright, David .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2011, 37 (05) :708-717
[10]   The social dilemma of autonomous vehicles [J].
Bonnefon, Jean-Francois ;
Shariff, Azim ;
Rahwan, Iyad .
SCIENCE, 2016, 352 (6293) :1573-1576