Conditional DQN-Based Motion Planning With Fuzzy Logic for Autonomous Driving

被引：86

作者：

Chen, Long ^{[1
,2
]}

Hu, Xuemin ^{[3
]}

Tang, Bo ^{[4
]}

Cheng, Yu ^{[3
]}

机构：

[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Peoples R China

[2] VIPioneers HuiTuo Inc, Qingdao 266109, Peoples R China

[3] Hubei Univ, Sch Comp Sci & Informat Engn, Wuhan 430062, Peoples R China

[4] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Motion planning; autonomous driving; reinforcement learning; conditional deep Q-network; fussy logic; NETWORKS;

D O I：

10.1109/TITS.2020.3025671

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Motion planning is one of the mast significant part in autonomous driving. Learning-based motion planning methods attract many researchers' attention due to the abilities of learning from the environment and directly making decisions from the perception. The deep Q-network, as a popular reinforcement learning method, has achieved great progress in autonomous driving, but these methods seldom use the global path information to handle the issue of directional planning such as making a turning at an intersection since the agent usually learns driving strategies only by the designed reward function, which is difficult to adapt to the driving scenarios of urban roads. Moreover, different motion commands such as the steering wheel and accelerator are associated with each other from classic Q-networks, which easily leads to an unstable prediction of the motion commands since they are independently controlled in a practical driving system. In this paper, a conditional deep Q-network for directional planning is proposed and applied in end-to-end autonomous driving, where the global path is used to guide the vehicle to drive from the origination to the destination. To handle the dependency of different motion commands in Q-networks, we take use of the idea of fuzzy control and develop a defuzzification method to improve the stability of predicting the values of different motion commands. We conduct comprehensive experiments in the CARLA simulator and compare our method with the state-of-the-art methods. Experimental results demonstrate the proposed method achieves better learning performance and driving stability performance than other methods.

引用

页码：2966 / 2977

页数：12

共 34 条

[1]

[Anonymous], DEEP REINFORCEMENT L

[2] Learning Driving Models From Parallel End-to-End Driving Data Set [J].

Chen, Long ;

Wang, Qing ;

Lu, Xiankai ;

Cao, Dongpu ;

Wang, Fei-Yue .

PROCEEDINGS OF THE IEEE, 2020, 108 (02) :262-273

[3] Deep Integration: A Multi-Label Architecture for Road Scene Recognition [J].

Chen, Long ;

Zhan, Wujing ;

Tian, Wei ;

He, Yuhang ;

Zou, Qin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) :4883-4898

[4] Parallel Motion Planning: Learning a Deep Planning Model Against Emergencies [J].

Chen, Long ;

Hu, Xuemin ;

Tang, Bo ;

Cao, Dongpu .

IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2019, 11 (01) :36-41

[5]

Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274

[6] Long-Term Recurrent Convolutional Networks for Visual Recognition and Description [J].

Donahue, Jeff ;

Hendricks, Lisa Anne ;

Rohrbach, Marcus ;

Venugopalan, Subhashini ;

Guadarrama, Sergio ;

Saenko, Kate ;

Darrell, Trevor .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) :677-691

[7]

Dosovitskiy A, 2017, PR MACH LEARN RES, V78

[8]

Eraqi Hesham M., 2017, Nips, P1

[9] Dynamic path planning for autonomous driving on various roads with avoidance of static and moving obstacles [J].

Hu, Xuemin ;

Chen, Long ;

Tang, Bo ;

Cao, Dongpu ;

He, Haibo .

MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 100 :482-500

[10] Adaptive Fuzzy Behavioral Control of Second-Order Autonomous Agents With Prioritized Missions: Theory and Experiments [J].

Huang, Jie ;

Zhou, Ning ;

Cao, Ming .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (12) :9612-9622

← 1 2 3 4 →