DDC3N: Doppler-Driven Convolutional 3D Network for Human Action Recognition

被引:2
作者
Toshpulatov, Mukhiddin [1 ]
Lee, Wookey [1 ]
Lee, Suan [2 ]
Yoon, Hoyoung [3 ]
Kang, U. Kang [3 ]
机构
[1] Inha Univ, Biomed Sci & Engn, Incheon 22212, South Korea
[2] Semyung Univ, Sch Comp Sci, Jecheon 27136, South Korea
[3] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul 08826, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
3D pose estimation; discriminator; deep neural network; deep learning; generator; mesh estimation; metadata; skeleton; top-down approach; motion embedding; optical flow map; channel-wise; spatiotemporal; doppler; dataset; action recognition; 2D;
D O I
10.1109/ACCESS.2024.3422428
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In deep learning (DL)-based human action recognition (HAR), considerable strides have been undertaken. Nevertheless, the precise classification of sports athletes' actions still needs to be completed. Primarily attributable to the exigency for exhaustive datasets about sports athletes' actions and the enduring quandaries imposed by variable camera perspectives, mercurial lighting conditions, and occlusions. This investigative endeavor thoroughly examines extant HAR datasets, furnishing a yardstick for gauging the efficacy of cutting-edge methodologies. In light of the paucity of accessible datasets delineating athlete actions, we have taken a proactive stance, endeavoring to curate two meticulously datasets tailored explicitly for sports athletes, subsequently scrutinizing their consequential impact on performance enhancement. While the superiority of 3D convolutional neural networks (3DCNN) over graph convolutional networks (GCN) in HAR is evident, it must be acknowledged that they entail a considerable computational overhead, particularly when confronted with voluminous datasets. Our inquiry introduces innovative methodologies and a more resource-efficient remedy for HAR, thereby alleviating the computational strain on the 3DCNN architecture. Consequently, it proffers a multifaceted approach towards augmenting HAR within the purview of surveillance cameras, bridging lacunae, surmounting computational impediments, and effectuating significant strides in the accuracy and efficacy of HAR frameworks.
引用
收藏
页码:93546 / 93567
页数:22
相关论文
共 84 条
  • [51] Multi-scale spatialtemporal information deep fusion network with temporal pyramid mechanism for video action recognition
    Ou, Hongshi
    Sun, Jifeng
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (03) : 4533 - 4545
  • [52] A Body Part Embedding Model With Datasets for Measuring 2D Human Motion Similarity
    Park, Jonghyuk
    Cho, Sukhyun
    Kim, Dongwoo
    Bailo, Oleksandr
    Park, Heewoong
    Hong, Sanghoon
    Park, Jonghun
    [J]. IEEE ACCESS, 2021, 9 : 36547 - 36558
  • [53] Paulius D, 2020, Arxiv, DOI arXiv:2007.06695
  • [54] Human Pose Estimation-Based Real-Time Gait Analysis Using Convolutional Neural Network
    Rohan, Ali
    Rabah, Mohammed
    Hosny, Tarek
    Kim, Sung-Ho
    [J]. IEEE ACCESS, 2020, 8 : 191542 - 191550
  • [55] Skeleton-Based Action Recognition with Directed Graph Neural Networks
    Shi, Lei
    Zhang, Yifan
    Cheng, Jian
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7904 - 7913
  • [56] Human Action Recognition From Various Data Modalities: A Review
    Sun, Zehua
    Ke, Qiuhong
    Rahmani, Hossein
    Bennamoun, Mohammed
    Wang, Gang
    Liu, Jun
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3200 - 3225
  • [57] Human Action Recognition Utilizing Doppler-Enhanced Convolutional 3D Networks
    Toshpulatov, Mukhiddin
    Lee, Wookey
    Tursunbaev, Chingiz
    Lee, Suan
    [J]. 2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 475 - 478
  • [58] Talking human face generation: A survey
    Toshpulatov, Mukhiddin
    Lee, Wookey
    Lee, Suan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
  • [59] Human pose, hand and mesh estimation using deep learning: a survey
    Toshpulatov, Mukhiddin
    Lee, Wookey
    Lee, Suan
    Roudsari, Arousha Haghighian
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (06) : 7616 - 7654
  • [60] Generative adversarial networks and their application to 3D face generation: A survey
    Toshpulatov, Mukhiddin
    Lee, Wookey
    Lee, Suan
    [J]. IMAGE AND VISION COMPUTING, 2021, 108