Human-Aligned Longitudinal Control for Occluded Pedestrian Crossing With Visual Attention

被引：0

作者：

Asodia, Vinal ^{[1
]}

Feng, Zhenhua ^{[2
]}

Fallah, Saber ^{[1
]}

机构：

[1] Univ Surrey, Dept Mech Engn Sci, Guildford GU2 7XH, Surrey, England

[2] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, Surrey, England

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024 | 2024年

关键词：

D O I：

10.1109/ICRA57147.2024.10611046

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement Learning (RL) has been widely used to create generalizable autonomous vehicles. However, they rely on fixed reward functions that struggle to balance values like safety and efficiency. How can autonomous vehicles balance different driving objectives and human values in a constantly changing environment? To bridge this gap, we propose an adaptive reward function that utilizes visual attention maps to detect pedestrians in the driving scene and dynamically switch between prioritizing safety or efficiency depending on the current observation. The visual attention map is used to provide spatial attention to the RL agent to boost the training efficiency of the pipeline. We evaluate the pipeline against variants of an occluded pedestrian crossing scenario in the CARLA Urban Driving simulator. Specifically, the proposed pipeline is compared against a modular setup that combines the well-established object detection model, YOLO, with a Proximal Policy Optimization (PPO) agent. The results indicate that the proposed approach can compete with the modular setup while yielding greater training efficiency. The trajectories collected with the approach confirm the effectiveness of the proposed adaptive reward function.

引用

页码：7419 / 7425

页数：7

共 27 条

[1]

Atakishiyev S., 2021, ARXIV

[2]

Chen Guoxi, 2023, IEEE Trans. on Vehicular Technology

[3] An Integrated Lateral and Longitudinal Decision-Making Model for Autonomous Driving Based on Deep Reinforcement Learning [J].

Cui, Jianxun ;

Zhao, Boyuan ;

Qu, Mingcheng .

JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023

[4]

Dosovitskiy A, 2017, PR MACH LEARN RES, V78

[5]

Elallid Badr Ben, 2023, ARXIV

[6] Attention mechanisms in computer vision: A survey [J].

Guo, Meng-Hao ;

Xu, Tian-Xing ;

Liu, Jiang-Jiang ;

Liu, Zheng-Ning ;

Jiang, Peng-Tao ;

Mu, Tai-Jiang ;

Zhang, Song-Hai ;

Martin, Ralph R. ;

Cheng, Ming-Ming ;

Hu, Shi-Min .

COMPUTATIONAL VISUAL MEDIA, 2022, 8 (03) :331-368

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8]

Kim J, 2019, PROC CVPR IEEE, P10583, DOI [10.1109/CVPR.2019.01084, 10.1109/CVP8.2019.01084]

[9] Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention [J].

Kim, Jinkyu ;

Canny, John .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2961-2969

[10] Saliency Heat-Map as Visual Attention for Autonomous Driving Using Generative Adversarial Network (GAN) [J].

Lateef, Fahad ;

Kas, Mohamed ;

Ruichek, Yassine .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) :5360-5373

← 1 2 3 →