Scenic: a language for scenario specification and data generation

被引:47
作者
Fremont, Daniel J. [1 ]
Kim, Edward [2 ]
Dreossi, Tommaso [3 ]
Ghosh, Shromona [4 ]
Yue, Xiangyu [2 ]
Sangiovanni-Vincentelli, Alberto L. [2 ]
Seshia, Sanjit A. [2 ]
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Insitro, San Francisco, CA USA
[4] Waymo LLC, Mountain View, CA USA
基金
美国国家科学基金会;
关键词
Scenario description language; Synthetic data; Deep learning; Probabilistic programming; Debugging; Automatic test generation; Simulation; D; 2; 5; 3; I; 6; 9;
D O I
10.1007/s10994-021-06120-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new probabilistic programming language for the design and analysis of cyber-physical systems, especially those based on machine learning. We consider several problems arising in the design process, including training a system to be robust to rare events, testing its performance under different conditions, and debugging failures. We show how a probabilistic programming language can help address these problems by specifying distributions encoding interesting types of inputs, then sampling these to generate specialized training and test data. More generally, such languages can be used to write environment models, an essential prerequisite to any formal analysis. In this paper, we focus on systems such as autonomous cars and robots, whose environment at any point in time is a scene, a configuration of physical objects and agents. We design a domain-specific language, Scenic, for describing scenarios that are distributions over scenes and the behaviors of their agents over time. Scenic combines concise, readable syntax for spatiotemporal relationships with the ability to declaratively impose hard and soft constraints over the scenario. We develop specialized techniques for sampling from the resulting distribution, taking advantage of the structure provided by Scenic's domain-specific syntax. Finally, we apply Scenic in multiple case studies for training, testing, and debugging neural networks for perception both as standalone components and within the context of a full cyber-physical system.
引用
收藏
页码:3805 / 3849
页数:45
相关论文
共 70 条
[1]  
Amodei Dario., 2016, CORR
[2]  
Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[3]  
[Anonymous], 2017, ARXIV170600082
[4]  
[Anonymous], 2014, 3 NIPS WORKSH PROB P
[5]  
[Anonymous], 2018, UCBEECS20188
[6]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[7]  
Azad A. S., 2021, ARXIVABS210610365 CO
[8]  
Baidu, 2020, AP
[9]  
Broy M., 2005, MODEL BASED TESTING, V1st
[10]  
Claret Guillaume., 2013, Foundations of Software Engineering, P92