Intelligent Decision Making in Autonomous Vehicles using Cognition Aided Reinforcement Learning

被引:3
作者
Rathore, Heena [1 ]
Bhadauria, Vikram [2 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78285 USA
[2] Texas A&M Univ Texarkana, Dept Comp Sci Math MIS, Texarkana, TX USA
来源
2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC) | 2022年
基金
美国国家科学基金会;
关键词
autonomous vehicles; cognition; meta-cognition; reinforcement learning; SERVICE INNOVATION; METACOGNITION;
D O I
10.1109/WCNC51071.2022.9771728
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As recent advances in sensing, computing, and communications expedite proliferation of autonomous vehicles (AV), their sharing the road with human driven vehicles presents a challenge that demands urgent investigation. AVs can excel at deterministic programmed behavior, still human drivers have the edge because of the faculty of cognition, which evolved over millennia. This paper presents Cognition Aided Reinforcement Learning (CARL) algorithm that harnesses inputs from five principles of cognition - memory, attention, language, perception, and intelligence. Sensors build perception, data facilitate memory, and safety messages enable language support. Intelligence fuses information with attention focused on specific actions for reward maximization. Simulation results show CARL to be 10 times faster as compared to the state of the art model-free reinforcement learning algorithms. Additionally, by using the principle of metacognition (art of learning how to learn), CARL achieves optimal rewards in a heterogeneous environment composed of vehicles with varying degrees of autonomy.
引用
收藏
页码:524 / 529
页数:6
相关论文
共 40 条
  • [1] Barrett M, 2015, MIS QUART, V39, P135
  • [2] Bhalla Sushrut, 2020, Advances in Artificial Intelligence. 33rd Canadian Conference on Artificial Intelligence, Canadian AI 2020. Proceedings. Lecture Notes in Artificial Intelligence. Subseries of Lecture Notes in Computer Science (LNAI 12109), P67, DOI 10.1007/978-3-030-47358-7_7
  • [3] Campbell S, 2018, 2018 29TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC)
  • [4] Chen JY, 2019, IEEE INT C INTELL TR, P2765, DOI [10.1109/ITSC.2019.8917306, 10.1109/itsc.2019.8917306]
  • [5] Parallel Planning: A New Motion Planning Framework for Autonomous Driving
    Chen, Long
    Hu, Xuemin
    Tian, Wei
    Wang, Hong
    Cao, Dongpu
    Wang, Fei-Yue
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (01) : 236 - 246
  • [6] Multi-level metacognition for adaptive behavior
    Conn, Marvin
    M'Bale, Kenneth
    Josyula, Darsana
    [J]. BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, 2018, 26 : 174 - 183
  • [7] Davidson A.W., 2006, UNIFYING THEMES COMP, P157
  • [8] Eckstein M. K., 2021, BIORXIV
  • [10] The cognit: A network model of cortical representation
    Fuster, Joaquin M.
    [J]. INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2006, 60 (02) : 125 - 132