Multi-Input Fusion for Practical Pedestrian Intention Prediction

被引:23
|
作者
Singh, Ankur [1 ,2 ]
Suddamalla, Upendra [1 ]
机构
[1] Moovita Pte Ltd, Singapore, Singapore
[2] Indian Inst Technol Kanpur, Kanpur, Uttar Pradesh, India
关键词
D O I
10.1109/ICCVW54120.2021.00260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrians are the most vulnerable road users and are at a high risk of fatal accidents. Accurate pedestrian detection and effectively analyzing their intentions to cross the road are critical for autonomous vehicles and ADAS solutions to safely navigate public roads. Faster and precise estimation of pedestrian intention helps in adopting safe driving behavior. Visual pose and motion are two important cues that have been previously employed to determine pedestrian intention. However, motion patterns can give erroneous results for short-term video sequences and are thus prone to mistakes. In this work, we propose an intention prediction network that utilizes pedestrian bounding boxes, pose, bounding box coordinates, and takes advantage of global context along with the local setting. This network implicitly learns pedestrians' motion cues and location information to differentiate between a crossing and a non-crossing pedestrian. We experiment with different combinations of input features and propose multiple efficient models in terms of accuracy and inference speeds. Our best-performing model shows around 85% accuracy on the JAAD dataset.
引用
收藏
页码:2304 / 2311
页数:8
相关论文
共 50 条
  • [21] MULTI-INPUT ANALOG COMPARTORS
    EREMEEV, IS
    MEASUREMENT TECHNIQUES-USSR, 1970, (11): : 1713 - &
  • [22] A Novel Design Approach for Multi-input XOR Gate Using Multi-input Majority Function
    Esam Alkaldy
    Keivan Navi
    Fazel Sharifi
    Arabian Journal for Science and Engineering, 2014, 39 : 7923 - 7932
  • [23] Multi-input Fusion Spelling Error Correction Model Based on Contrast Optimization
    Wu Y.
    Huang R.
    Bai R.
    Cao J.
    Zhao J.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (01): : 85 - 94
  • [24] A Multi-Input Fusion Model for Privacy and Semantic Preservation in Facial Image Datasets
    Yang, Yuanzhe
    Niu, Zhiyi
    Qiu, Yuying
    Song, Biao
    Zhang, Xinchang
    Tian, Yuan
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [25] A Novel Design Approach for Multi-input XOR Gate Using Multi-input Majority Function
    Alkaldy, Esam
    Navi, Keivan
    Sharifi, Fazel
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2014, 39 (11) : 7923 - 7932
  • [26] TAT: Pedestrian Intention and Trajectory Prediction
    Su, Shi
    Guo, Fengpeng
    Chen, Zhuanghao
    Huang, Hongcheng
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4182 - 4185
  • [27] Protein Secondary Structure Prediction using Multi-input Convolutional Neural Network
    Jalal, Shayan Ihsan
    Zhong, Jiling
    Kumar, Suman
    2019 IEEE SOUTHEASTCON, 2019,
  • [28] Multi-input performance prediction models for flexible pavements using LTPP database
    Younos, M. A.
    Abd El-Hakim, R. T.
    El-Badawy, S. M.
    Afify, H. A.
    INNOVATIVE INFRASTRUCTURE SOLUTIONS, 2020, 5 (01)
  • [29] Multi-input performance prediction models for flexible pavements using LTPP database
    M. A. Younos
    R. T. Abd El-Hakim
    S. M. El-Badawy
    H. A. Afify
    Innovative Infrastructure Solutions, 2020, 5
  • [30] Investigation of multi-input convolutional neural networks for the prediction of particleboard mechanical properties
    Chen, Shuoye
    Sakai, Shunsuke
    Matsuo-Ueda, Miyuki
    Umemura, Kenji
    SCIENTIFIC REPORTS, 2025, 15 (01):