3D-Pruning: A Model Compression Framework for Efficient 3D Action Recognition

被引:10
|
作者
Guo, Jinyang [1 ]
Liu, Jiaheng [2 ]
Xu, Dong [3 ]
机构
[1] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[3] Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China
关键词
Point cloud compression; Three-dimensional displays; Computational complexity; Computational modeling; Solid modeling; Task analysis; Feature extraction; Efficient deep learning; point cloud; 3D action recognition; model compression; OBJECT DETECTION; POINT; NETWORKS;
D O I
10.1109/TCSVT.2022.3197395
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The existing end-to-end optimized 3D action recognition methods often suffer from high computational costs. Observing that different frames and different points in point cloud sequences often have different importance values for the 3D action recognition task, in this work, we propose a fully automatic model compression framework called 3D-Pruning (3DP) for efficient 3D action recognition. After performing model compression by using our 3DP framework, the compressed model can process different frames and different points in each frame by using different computational complexities based on their importance values, in which both the importance value and computational complexity for each frame/point can be automatically learned. Extensive experiments on five benchmark datasets demonstrate the effectiveness of our 3DP framework for model compression.
引用
收藏
页码:8717 / 8729
页数:13
相关论文
共 50 条
  • [31] Efficient Adversarial Attack Strategy Against 3D Object Detection in Autonomous Driving Systems
    Chen, Hai
    Yan, Huanqian
    Yang, Xiao
    Su, Hang
    Zhao, Shu
    Qian, Fulan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16118 - 16132
  • [32] 3D Features for human action recognition with semi-supervised learning
    Sahoo, Suraj Prakash
    Srinivasu, Ulli
    Ari, Samit
    IET IMAGE PROCESSING, 2019, 13 (06) : 983 - 990
  • [33] Universal Cross-Domain 3D Model Retrieval
    Song, Dan
    Li, Tian-Bao
    Li, Wen-Hui
    Nie, Wei-Zhi
    Liu, Wu
    Liu, An-An
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2721 - 2731
  • [34] A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
    Yu, Ting
    Lin, Xiaojun
    Wang, Shuhui
    Sheng, Weiguo
    Huang, Qingming
    Yu, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1322 - 1338
  • [35] A New Feature Descriptor for 3D Human Action Recognition
    Asadi-Aghbolaghi, Maryam
    Ramezanpour, Sadegh
    Kasaei, Shohreh
    2014 22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2014, : 1157 - 1161
  • [36] Flattening-Net: Deep Regular 2D Representation for 3D Point Cloud Analysis
    Zhang, Qijian
    Hou, Junhui
    Qian, Yue
    Zeng, Yiming
    Zhang, Juyong
    He, Ying
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9726 - 9742
  • [37] Efficient Pooling Operator for 3D Morphable Models
    Zhang, Haoliang
    Cheng, Samuel
    El Amm, Christian
    Kim, Jonghoon
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 4225 - 4233
  • [38] A Measurement Model for Aquatic Animals Based on Instance Segmentation and 3D Point Cloud
    He, Zhiqian
    Xu, Xiaoqing
    Luo, Jialu
    Chen, Ziwen
    Song, Weibo
    Cao, Lijie
    Huo, Zhongming
    IEEE ACCESS, 2024, 12 : 156208 - 156223
  • [39] Gaussian Model for 3D Mesh Steganography
    Zhu, Jiahao
    Zhang, Yushu
    Zhang, Xinpeng
    Cao, Xiaochun
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1729 - 1733
  • [40] VGNet: Multimodal Feature Extraction and Fusion Network for 3D CAD Model Retrieval
    Qin, Feiwei
    Zhan, Gaoyang
    Fang, Meie
    Chen, C. L. Philip
    Li, Ping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1432 - 1447