An Efficient Self-Evolution Method of Autonomous Driving for Any Given Algorithm

被引:12
作者
Huang, Yanjun [1 ,2 ]
Yang, Shuo [1 ,3 ]
Wang, Liwen [1 ]
Yuan, Kang [3 ,4 ]
Zheng, Hongyu [5 ]
Chen, Hong [4 ]
机构
[1] Tongji Univ, Sch Automot Studies, Shanghai 201804, Peoples R China
[2] Frontiers Sci Ctr Intelligent Autonomous Syst, Shanghai 200120, Peoples R China
[3] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201804, Peoples R China
[4] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[5] Jilin Univ, State Key Lab Automot Simulat & Control, Changchun 130022, Peoples R China
关键词
Autonomous driving; reinforcement learning; policy improvement;
D O I
10.1109/TITS.2023.3307873
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Autonomous vehicles are expected to achieve self-evolution in the real-world environment to gradually cover more complex and changing scenarios. Reinforcement learning focuses on how agents act in the environment to maximize the cumulative reward, with a great potential to achieve self-evolution ability. However, most of reinforcement learning algorithms suffer from a low sample efficiency, which greatly limits their application in autonomous driving. This paper presents an efficient self-evolution method for any given algorithm based on the combination of Soft Actor Critic (SAC) and Behavioral Cloning(BC). First, the states of the sample trajectory in the replay buffer are separated and input into the given algorithm (algorithm with fundamental performance) to get the output label of actions such that the SAC algorithm can be guided using BC to achieve fast iteration in the direction of optimization with existing basic performance. Then, the value iteration algorithm is combined to achieve the proportion allocation of mixed gradient feedback, in order to trade off exploitation and exploration. In addition, the proposed methodology is evaluated in simulation environment taking automated speed control as an example. Experiment results show that compared with SAC algorithm, the proposed method can realize more than three times of convergence efficiency improvement, while without destroying the exploration enhancement advantage of reinforcement learning algorithm, that is, the performance is improved by 20% compared with the given algorithm (Intelligent Driver Model, IDM). The proposed method can easily extended to improve any given model no matter it is model-based or learning-based algorithm.
引用
收藏
页码:602 / 612
页数:11
相关论文
共 14 条
  • [1] A safe self-evolution algorithm for autonomous driving based on data-driven risk quantification model
    Yang, Shuo
    Li, Shizhen
    Huang, Yanjun
    Chen, Hong
    ACCIDENT ANALYSIS AND PREVENTION, 2025, 214
  • [2] Self-evolution Scenarios for Simulation Tests of Autonomous Vehicles Based on Different Models of Driving Styles
    Ma Y.-N.
    Jiang W.
    Wu J.-Y.
    Chen J.-Y.
    Li N.
    Xu Z.-G.
    Xiong L.
    Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2023, 36 (02): : 216 - 228
  • [3] A Fair and Efficient Federated Learning Algorithm for Autonomous Driving
    Tang, Xinlong
    Zhang, Jiayi
    Fu, Yuchuan
    Li, Changle
    Cheng, Nan
    Yuan, Xiaoming
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [4] Self-supervised binocular depth estimation algorithm with self-rectification for autonomous driving
    Bao, Jingyao
    Yu, Hongfei
    Zou, Yongjia
    Lv, Jin
    Liu, Wei
    Cao, Yang
    IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (08) : 1445 - 1458
  • [5] SeFlow: A Self-supervised Scene Flow Method in Autonomous Driving
    Zhang, Qingwen
    Yang, Yi
    Li, Peizheng
    Andersson, Olov
    Jensfelt, Patric
    COMPUTER VISION-ECCV 2024, PT I, 2025, 15059 : 353 - 369
  • [6] Efficient 3D Object Detection Models and Evaluation Method for Autonomous Driving
    Lee, Jin-Hee
    Lee, Jae-Keun
    Lee, Joohyun
    Kim, Je-Seok
    Kwon, Soon
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [7] End-to-End Autonomous Driving Decision Method Based on Improved TD3 Algorithm in Complex Scenarios
    Xu, Tao
    Meng, Zhiwei
    Lu, Weike
    Tong, Zhongwen
    SENSORS, 2024, 24 (15)
  • [8] An efficient planning method based on deep reinforcement learning with hybrid actions for autonomous driving on highway
    Mei Zhang
    Kai Chen
    Jinhui Zhu
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3483 - 3499
  • [9] An efficient planning method based on deep reinforcement learning with hybrid actions for autonomous driving on highway
    Zhang, Mei
    Chen, Kai
    Zhu, Jinhui
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3483 - 3499
  • [10] Dynamic obstacle avoidance path planning method for autonomous driving based on quantum ant colony algorithm
    Yao, Y.
    Wang, A.J.
    Shang, F.M.
    Advances in Transportation Studies, 2024, 2 (Special issue): : 29 - 40