On Offline Evaluation of Vision-Based Driving Models

被引：56

作者：

Codevilla, Felipe ^{[1
]}

Lopez, Antonio M. ^{[1
]}

Koltun, Vladlen ^{[2
]}

Dosovitskiy, Alexey ^{[3
]}

机构：

[1] Univ Autonoma Barcelona, Comp Vis Ctr, Barcelona, Spain

[2] Intel Labs, Santa Clara, CA USA

[3] Intel Labs, Munich, Germany

来源：

COMPUTER VISION - ECCV 2018, PT 15 | 2018年 / 11219卷

关键词：

Autonomous driving; Deep learning;

D O I：

10.1007/978-3-030-01267-0_15

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Autonomous driving models should ideally be evaluated by deploying them on a fleet of physical vehicles in the real world. Unfortunately, this approach is not practical for the vast majority of researchers. An attractive alternative is to evaluate models offline, on a pre-collected validation dataset with ground truth annotation. In this paper, we investigate the relation between various online and offline metrics for evaluation of autonomous driving models. We find that offline prediction error is not necessarily correlated with driving quality, and two models with identical prediction error can differ dramatically in their driving performance. We show that the correlation of offline evaluation with driving quality can be significantly improved by selecting an appropriate validation dataset and suitable offline metrics.

引用

页码：246 / 262

页数：17

共 27 条

[1]

[Anonymous], 2017, ADV NEURAL INFORM PR

[2]

[Anonymous], Torcs, the open racing car simulator

[3]

[Anonymous], 2005, Advances in neural information processing systems

[4]

Bojarski Mariusz, 2016, arXiv

[5]

Bresson G., 2017, IEEE T INTELL VEH

[6] DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving [J].

Chen, Chenyi ;

Seff, Ari ;

Kornhauser, Alain ;

Xiao, Jianxiong .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2722-2730

[7]

Codevilla F., 2018, INT C ROB AUT ICRA, P1, DOI [DOI 10.1109/ICRA.2018.8460487, 10.1109/ICRA.2018.8460487]

[8] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[9]

Dosovitskiy A., 2017, P 1 ANN C ROB LEARN, P1, DOI DOI 10.48550/ARXIV.1711.03938

[10]

Ebrahimi S., 2017, P C ROB LEARN, P505

← 1 2 3 →