PVFAN: Point-view fusion attention network for 3D shape recognition

被引:0
作者
Cao, Jiangzhong [1 ]
Liao, Siyi [1 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou, Peoples R China
关键词
3D Shape recognition; multimodal feature fusion; feature refinement; attention mechanism; CLASSIFICATION; RETRIEVAL; DEPTH;
D O I
10.3233/JIFS-232800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D shape recognition is a critical research topic in the field of computer vision, attracting substantial attention. Existing approaches mainly focus on extracting distinctive 3D shape features; however, they often neglect the model's robustness and lack refinement in deep features. To address these limitations, we propose the point-view fusion attention network that aims to extract a concise, informative, and robust3Dshape descriptor. Initially, our approach combines multi-view features with point cloud features to obtain accurate and distinguishable fusion features. To effectively handle these fusion features, we design a dual-attention convolutional network which consists of a channel attention module and a spatial attention module. This dual-attention mechanism greatly enhances the generalization ability and robustness of 3D recognition models. Notably, we introduce a strip-pooling layer in the channel attention module to refine the features, resulting in improved fusion features that are more compact. Finally, a classification process is performed on the refined features to assign appropriate 3D shape labels. Our extensive experiments on the ModelNet10 and ModelNet40 datasets for 3D shape recognition and retrieval demonstrate the remarkable accuracy and robustness of the proposed method.
引用
收藏
页码:8119 / 8133
页数:15
相关论文
共 65 条
[1]   Multi-Scale Representation Learning on Hypergraph for 3D Shape Retrieval and Recognition [J].
Bai, Junjie ;
Gong, Biao ;
Zhao, Yining ;
Lei, Fuqiang ;
Yan, Chenggang ;
Gao, Yue .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) :5327-5338
[2]   GIFT: A Real-time and Scalable 3D Shape Search Engine [J].
Bai, Song ;
Bai, Xiang ;
Zhou, Zhichao ;
Zhang, Zhaoxiang ;
Latecki, Longin Jan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5023-5032
[3]  
Brock A, 2016, Arxiv, DOI arXiv:1608.04236
[4]   Multimodal Feature Fusion for 3D Shape Recognition and Retrieval [J].
Bu, Shuhui ;
Cheng, Shaoguang ;
Liu, Zhenbao ;
Han, Junwei .
IEEE MULTIMEDIA, 2014, 21 (04) :38-46
[5]   On visual similarity based 3D model retrieval [J].
Chen, DY ;
Tian, XP ;
Shen, YT ;
Ming, OY .
COMPUTER GRAPHICS FORUM, 2003, 22 (03) :223-232
[6]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[7]   3D Shape Classification Using a Single View [J].
Ding, Bo ;
Tang, Lei ;
Gao, Zheng ;
He, Yongjun .
IEEE ACCESS, 2020, 8 :200812-200822
[8]   General-Purpose Deep Point Cloud Feature Extractor [J].
Dominguez, Miguel ;
Dhamdhere, Rohan ;
Petkar, Atir ;
Jain, Saloni ;
Sah, Shagan ;
Ptucha, Raymond .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :1972-1981
[9]   Comprehensive and Practical Vision System for Self-Driving Vehicle Lane-Level Localization [J].
Du, Xinxin ;
Tan, Kok Kiong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) :2075-2088
[10]   GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition [J].
Feng, Yifan ;
Zhang, Zizhao ;
Zhao, Xibin ;
Ji, Rongrong ;
Gao, Yue .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :264-272