Depth Video-Based Secondary Action Recognition in Vehicles via Convolutional Neural Network and Bidirectional Long Short-Term Memory with Spatial Enhanced Attention Mechanism

被引:0
|
作者
Shao, Weirong [1 ]
Bouazizi, Mondher [2 ]
Tomoaki, Ohtuski [2 ]
机构
[1] Keio Univ, Grad Sch Sci & Technol, Yokohama 2238522, Japan
[2] Keio Univ, Fac Sci & Technol, Yokohama 2238522, Japan
关键词
action recognition; deep learning; attention mechanism; depth sensor; MODEL;
D O I
10.3390/s24206604
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Secondary actions in vehicles are activities that drivers engage in while driving that are not directly related to the primary task of operating the vehicle. Secondary Action Recognition (SAR) in drivers is vital for enhancing road safety and minimizing accidents related to distracted driving. It also plays an important part in modern car driving systems such as Advanced Driving Assistance Systems (ADASs), as it helps identify distractions and predict the driver's intent. Traditional methods of action recognition in vehicles mostly rely on RGB videos, which can be significantly impacted by external conditions such as low light levels. In this research, we introduce a novel method for SAR. Our approach utilizes depth-video data obtained from a depth sensor located in a vehicle. Our methodology leverages the Convolutional Neural Network (CNN), which is enhanced by the Spatial Enhanced Attention Mechanism (SEAM) and combined with Bidirectional Long Short-Term Memory (Bi-LSTM) networks. This method significantly enhances action recognition ability in depth videos by improving both the spatial and temporal aspects. We conduct experiments using K-fold cross validation, and the experimental results show that on the public benchmark dataset Drive&Act, our proposed method shows significant improvement in SAR compared to the state-of-the-art methods, reaching an accuracy of about 84% in SAR in depth videos.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Recognition Method of Massage Techniques Based on Attention Mechanism and Convolutional Long Short-Term Memory Neural Network
    Zhu, Shengding
    Lei, Jingtao
    Chen, Dongdong
    SENSORS, 2022, 22 (15)
  • [2] Speech emotion recognition based on convolutional neural network with attention-based bidirectional long short-term memory network and multi-task learning
    Liu, Zhen-Tao
    Han, Meng-Ting
    Wu, Bao-Han
    Rehman, Abdul
    APPLIED ACOUSTICS, 2023, 202
  • [3] Attention-Based Convolution Skip Bidirectional Long Short-Term Memory Network for Speech Emotion Recognition
    Zhang, Huiyun
    Huang, Heming
    Han, Henry
    IEEE ACCESS, 2021, 9 : 5332 - 5342
  • [4] Improving Mandarin Tone Recognition using Convolutional Bidirectional Long Short-Term Memory with Attention
    Yang, Longfei
    Xie, Yanlu
    Zhang, Jinsong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 352 - 356
  • [5] Bridge weigh-in-motion through bidirectional Recurrent Neural Network with long short-term memory and attention mechanism
    Wang, Zhichao
    Wang, Yang
    SMART STRUCTURES AND SYSTEMS, 2021, 27 (02) : 241 - 256
  • [6] Intrusion Detection Based on Bidirectional Long Short-Term Memory with Attention Mechanism
    Yang, Yongjie
    Tu, Shanshan
    Ali, Raja Hashim
    Alasmary, Hisham
    Waqas, Muhammad
    Amjad, Muhammad Nouman
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 801 - 815
  • [7] Pose-based multisource networks using convolutional neural network and long short-term memory for action recognition
    Hu, Fangqiang
    Wu, Qianyu
    Zhang, Sai
    Zhu, Aichun
    Wang, Zixuan
    Bao, Yaping
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (04)
  • [8] Recognition of aggressive episodes of pigs based on convolutional neural network and long short-term memory
    Chen, Chen
    Zhu, Weixing
    Steibel, Juan
    Siegford, Janice
    Wurtz, Kaitlin
    Han, Junjie
    Norton, Tomas
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 169
  • [9] Context-Aware Memory Attention Network for Video-Based Action Recognition
    Koh, Thean Chun
    Yeo, Chai Kiat
    Vaitesswar, U. S.
    Jing, Xuan
    2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
  • [10] Sentiment classification using attention mechanism and bidirectional long short-term memory network
    Wu, Peng
    Li, Xiaotong
    Ling, Chen
    Ding, Shengchun
    Shen, Si
    APPLIED SOFT COMPUTING, 2021, 112