Conditional DQN-Based Motion Planning With Fuzzy Logic for Autonomous Driving

被引:69
作者
Chen, Long [1 ,2 ]
Hu, Xuemin [3 ]
Tang, Bo [4 ]
Cheng, Yu [3 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Peoples R China
[2] VIPioneers HuiTuo Inc, Qingdao 266109, Peoples R China
[3] Hubei Univ, Sch Comp Sci & Informat Engn, Wuhan 430062, Peoples R China
[4] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
基金
中国国家自然科学基金;
关键词
Motion planning; autonomous driving; reinforcement learning; conditional deep Q-network; fussy logic; NETWORKS;
D O I
10.1109/TITS.2020.3025671
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Motion planning is one of the mast significant part in autonomous driving. Learning-based motion planning methods attract many researchers' attention due to the abilities of learning from the environment and directly making decisions from the perception. The deep Q-network, as a popular reinforcement learning method, has achieved great progress in autonomous driving, but these methods seldom use the global path information to handle the issue of directional planning such as making a turning at an intersection since the agent usually learns driving strategies only by the designed reward function, which is difficult to adapt to the driving scenarios of urban roads. Moreover, different motion commands such as the steering wheel and accelerator are associated with each other from classic Q-networks, which easily leads to an unstable prediction of the motion commands since they are independently controlled in a practical driving system. In this paper, a conditional deep Q-network for directional planning is proposed and applied in end-to-end autonomous driving, where the global path is used to guide the vehicle to drive from the origination to the destination. To handle the dependency of different motion commands in Q-networks, we take use of the idea of fuzzy control and develop a defuzzification method to improve the stability of predicting the values of different motion commands. We conduct comprehensive experiments in the CARLA simulator and compare our method with the state-of-the-art methods. Experimental results demonstrate the proposed method achieves better learning performance and driving stability performance than other methods.
引用
收藏
页码:2966 / 2977
页数:12
相关论文
共 34 条
  • [1] [Anonymous], DEEP REINFORCEMENT L
  • [2] Learning Driving Models From Parallel End-to-End Driving Data Set
    Chen, Long
    Wang, Qing
    Lu, Xiankai
    Cao, Dongpu
    Wang, Fei-Yue
    [J]. PROCEEDINGS OF THE IEEE, 2020, 108 (02) : 262 - 273
  • [3] Deep Integration: A Multi-Label Architecture for Road Scene Recognition
    Chen, Long
    Zhan, Wujing
    Tian, Wei
    He, Yuhang
    Zou, Qin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4883 - 4898
  • [4] Parallel Motion Planning: Learning a Deep Planning Model Against Emergencies
    Chen, Long
    Hu, Xuemin
    Tang, Bo
    Cao, Dongpu
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2019, 11 (01) : 36 - 41
  • [5] Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
  • [6] Long-Term Recurrent Convolutional Networks for Visual Recognition and Description
    Donahue, Jeff
    Hendricks, Lisa Anne
    Rohrbach, Marcus
    Venugopalan, Subhashini
    Guadarrama, Sergio
    Saenko, Kate
    Darrell, Trevor
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 677 - 691
  • [7] Dosovitskiy Alexey, 2017, PMLR, P1
  • [8] Eraqi Hesham M., 2017, Nips, P1
  • [9] Dynamic path planning for autonomous driving on various roads with avoidance of static and moving obstacles
    Hu, Xuemin
    Chen, Long
    Tang, Bo
    Cao, Dongpu
    He, Haibo
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 100 : 482 - 500
  • [10] Adaptive Fuzzy Behavioral Control of Second-Order Autonomous Agents With Prioritized Missions: Theory and Experiments
    Huang, Jie
    Zhou, Ning
    Cao, Ming
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2019, 66 (12) : 9612 - 9622