DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving

被引：2

作者：

Dagdanov, Resul ^{[1
,2
]}

Eksen, Feyza ^{[1
,3
]}

Durmus, Halil ^{[4
,5
]}

Yurdakul, Ferhat ^{[1
,2
]}

Ure, Nazim Kemal ^{[1
,2
]}

机构：

[1] Istanbul Tech Univ, ITU Artificial Intelligence & Data Sci Res Ctr, Istanbul, Turkey

[2] Istanbul Tech Univ, Dept Aeronaut Engn, Istanbul, Turkey

[3] Istanbul Tech Univ, Dept Comp Engn, Istanbul, Turkey

[4] Istanbul Tech Univ, Eatron Technol, Istanbul, Turkey

[5] Istanbul Tech Univ, Dept Elect & Commun Engn, Istanbul, Turkey

来源：

2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC) | 2022年

关键词：

Imitation Learning; Reinforcement Learning; Autonomous Driving;

D O I：

10.1109/ITSC55140.2022.9922209

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Safely navigating through an urban environment without violating any traffic rules is a crucial performance target for reliable autonomous driving. In this paper, we present a Reinforcement Learning (RL) based methodology to DEtect and FIX (DeFIX) failures of an Imitation Learning (IL) agent by extracting infraction spots and re-constructing mini-scenarios on these infraction areas to train an RL agent for fixing the shortcomings of the IL approach. DeFIX is a continuous learning framework, where extraction of failure scenarios and training of RL agents are executed in an infinite loop. After each new policy is trained and added to the library of policies, a policy classifier method effectively decides on which policy to activate at each step during the evaluation. It is demonstrated that even with only one RL agent trained on failure scenario of an IL agent, DeFIX method is either competitive or does outperform state-of-the-art IL and RL based autonomous urban driving benchmarks. We trained and validated our approach on the most challenging map (Town05) of CARLA simulator which involves complex, realistic, and adversarial driving scenarios. The source code is publicly available at https://github. com/data- and- decision- lab/DeFIX

引用

页码：4215 / 4220

页数：6

共 14 条

[1]

Alizadeh A, 2019, IEEE INT C INTELL TR, P1399, DOI [10.1109/itsc.2019.8917192, 10.1109/ITSC.2019.8917192]

[2]

[Anonymous], 2019, CORL, DOI DOI 10.48550/ARXIV.1912.12294

[3]

Bicer Y, 2019, IEEE INT C INT ROBOT, P2629, DOI [10.1109/iros40897.2019.8967948, 10.1109/IROS40897.2019.8967948]

[4]

Chekroun R., 2021, GRI GEN REINFORCED I, Vabs/2111.08575

[5]

Chen Dian, 2021, P IEEE CVF INT C COM, P15590

[6]

Chitta K., 2021, P IEEE CVF INT C COM, P15793

[7] Exploring the Limitations of Behavior Cloning for Autonomous Driving [J].

Codevilla, Felipe ;

Santana, Eder ;

Lopez, Antonio M. ;

Gaidon, Adrien .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9328-9337

[8]

Dosovitskiy A., 2017, C ROBOT LEARNING

[9]

HE KM, 2016, PROC CVPR IEEE, P770, DOI DOI 10.1109/CVPR.2016.90

[10] Investigating Value of Curriculum Reinforcement Learning in Autonomous Driving Under Diverse Road and Weather Conditions [J].

Ozturk, Anil ;

Gunel, Mustafa Burak ;

Dagdanov, Resul ;

Vural, Mira Ekim ;

Yurdakul, Ferhat ;

Dal, Melih ;

Ure, Nazim Kemal .

2021 IEEE INTELLIGENT VEHICLES SYMPOSIUM WORKSHOPS (IV WORKSHOPS), 2021, :358-363

← 1 2 →