Multi-Input Fusion for Practical Pedestrian Intention Prediction

被引：23

作者：

Singh, Ankur ^{[1
,2
]}

Suddamalla, Upendra ^{[1
]}

机构：

[1] Moovita Pte Ltd, Singapore, Singapore

[2] Indian Inst Technol Kanpur, Kanpur, Uttar Pradesh, India

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021) | 2021年

关键词：

D O I：

10.1109/ICCVW54120.2021.00260

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pedestrians are the most vulnerable road users and are at a high risk of fatal accidents. Accurate pedestrian detection and effectively analyzing their intentions to cross the road are critical for autonomous vehicles and ADAS solutions to safely navigate public roads. Faster and precise estimation of pedestrian intention helps in adopting safe driving behavior. Visual pose and motion are two important cues that have been previously employed to determine pedestrian intention. However, motion patterns can give erroneous results for short-term video sequences and are thus prone to mistakes. In this work, we propose an intention prediction network that utilizes pedestrian bounding boxes, pose, bounding box coordinates, and takes advantage of global context along with the local setting. This network implicitly learns pedestrians' motion cues and location information to differentiate between a crossing and a non-crossing pedestrian. We experiment with different combinations of input features and propose multiple efficient models in terms of accuracy and inference speeds. Our best-performing model shows around 85% accuracy on the JAAD dataset.

引用

页码：2304 / 2311

页数：8

共 50 条

[21] MULTI-INPUT ANALOG COMPARTORS
EREMEEV, IS
MEASUREMENT TECHNIQUES-USSR, 1970, (11): : 1713 - &
[22] A Novel Design Approach for Multi-input XOR Gate Using Multi-input Majority Function
Esam Alkaldy
Keivan Navi
Fazel Sharifi
Arabian Journal for Science and Engineering, 2014, 39 : 7923 - 7932
[23] Multi-input Fusion Spelling Error Correction Model Based on Contrast Optimization
Wu Y.
Huang R.
Bai R.
Cao J.
Zhao J.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (01): : 85 - 94
[24] A Multi-Input Fusion Model for Privacy and Semantic Preservation in Facial Image Datasets
Yang, Yuanzhe
Niu, Zhiyi
Qiu, Yuying
Song, Biao
Zhang, Xinchang
Tian, Yuan
APPLIED SCIENCES-BASEL, 2023, 13 (11):
[25] A Novel Design Approach for Multi-input XOR Gate Using Multi-input Majority Function
Alkaldy, Esam
Navi, Keivan
Sharifi, Fazel
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2014, 39 (11) : 7923 - 7932
[26] TAT: Pedestrian Intention and Trajectory Prediction
Su, Shi
Guo, Fengpeng
Chen, Zhuanghao
Huang, Hongcheng
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4182 - 4185
[27] Protein Secondary Structure Prediction using Multi-input Convolutional Neural Network
Jalal, Shayan Ihsan
Zhong, Jiling
Kumar, Suman
2019 IEEE SOUTHEASTCON, 2019,
[28] Multi-input performance prediction models for flexible pavements using LTPP database
Younos, M. A.
Abd El-Hakim, R. T.
El-Badawy, S. M.
Afify, H. A.
INNOVATIVE INFRASTRUCTURE SOLUTIONS, 2020, 5 (01)
[29] Multi-input performance prediction models for flexible pavements using LTPP database
M. A. Younos
R. T. Abd El-Hakim
S. M. El-Badawy
H. A. Afify
Innovative Infrastructure Solutions, 2020, 5
[30] Investigation of multi-input convolutional neural networks for the prediction of particleboard mechanical properties
Chen, Shuoye
Sakai, Shunsuke
Matsuo-Ueda, Miyuki
Umemura, Kenji
SCIENTIFIC REPORTS, 2025, 15 (01):

← 1 2 3 4 5 →