3D-Pruning: A Model Compression Framework for Efficient 3D Action Recognition

被引：10

作者：

Guo, Jinyang ^{[1
]}

Liu, Jiaheng ^{[2
]}

Xu, Dong ^{[3
]}

机构：

[1] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China

[2] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China

[3] Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 12期

关键词：

Point cloud compression; Three-dimensional displays; Computational complexity; Computational modeling; Solid modeling; Task analysis; Feature extraction; Efficient deep learning; point cloud; 3D action recognition; model compression; OBJECT DETECTION; POINT; NETWORKS;

D O I：

10.1109/TCSVT.2022.3197395

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The existing end-to-end optimized 3D action recognition methods often suffer from high computational costs. Observing that different frames and different points in point cloud sequences often have different importance values for the 3D action recognition task, in this work, we propose a fully automatic model compression framework called 3D-Pruning (3DP) for efficient 3D action recognition. After performing model compression by using our 3DP framework, the compressed model can process different frames and different points in each frame by using different computational complexities based on their importance values, in which both the importance value and computational complexity for each frame/point can be automatically learned. Extensive experiments on five benchmark datasets demonstrate the effectiveness of our 3DP framework for model compression.

引用

页码：8717 / 8729

页数：13

共 50 条

[31] Efficient Adversarial Attack Strategy Against 3D Object Detection in Autonomous Driving Systems
Chen, Hai
Yan, Huanqian
Yang, Xiao
Su, Hang
Zhao, Shu
Qian, Fulan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16118 - 16132
[32] 3D Features for human action recognition with semi-supervised learning
Sahoo, Suraj Prakash
Srinivasu, Ulli
Ari, Samit
IET IMAGE PROCESSING, 2019, 13 (06) : 983 - 990
[33] Universal Cross-Domain 3D Model Retrieval
Song, Dan
Li, Tian-Bao
Li, Wen-Hui
Nie, Wei-Zhi
Liu, Wu
Liu, An-An
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2721 - 2731
[34] A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Yu, Ting
Lin, Xiaojun
Wang, Shuhui
Sheng, Weiguo
Huang, Qingming
Yu, Jun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1322 - 1338
[35] A New Feature Descriptor for 3D Human Action Recognition
Asadi-Aghbolaghi, Maryam
Ramezanpour, Sadegh
Kasaei, Shohreh
2014 22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2014, : 1157 - 1161
[36] Flattening-Net: Deep Regular 2D Representation for 3D Point Cloud Analysis
Zhang, Qijian
Hou, Junhui
Qian, Yue
Zeng, Yiming
Zhang, Juyong
He, Ying
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9726 - 9742
[37] Efficient Pooling Operator for 3D Morphable Models
Zhang, Haoliang
Cheng, Samuel
El Amm, Christian
Kim, Jonghoon
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 4225 - 4233
[38] A Measurement Model for Aquatic Animals Based on Instance Segmentation and 3D Point Cloud
He, Zhiqian
Xu, Xiaoqing
Luo, Jialu
Chen, Ziwen
Song, Weibo
Cao, Lijie
Huo, Zhongming
IEEE ACCESS, 2024, 12 : 156208 - 156223
[39] Gaussian Model for 3D Mesh Steganography
Zhu, Jiahao
Zhang, Yushu
Zhang, Xinpeng
Cao, Xiaochun
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1729 - 1733
[40] VGNet: Multimodal Feature Extraction and Fusion Network for 3D CAD Model Retrieval
Qin, Feiwei
Zhan, Gaoyang
Fang, Meie
Chen, C. L. Philip
Li, Ping
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1432 - 1447

← 1 2 3 4 5 →