Progressive conditional GAN-based augmentation for 3D object recognition

被引:12
作者
Muzahid, A. A. M. [1 ,2 ]
Wanggen, Wan [1 ,2 ]
Sohel, Ferdous [3 ]
Bennamoun, Mohammed [4 ]
Hou, Li [5 ]
Ullah, Hidayat [1 ,2 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Inst Smart City, Shanghai 200444, Peoples R China
[3] Murdoch Univ, Informat Technol, Ctr Crop & Food Innovat, Murdoch, WA 6150, Australia
[4] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA 6009, Australia
[5] Huangshan Univ, Sch Informat Engn, Huangshan 245041, Peoples R China
基金
安徽省自然科学基金;
关键词
3D object classification; GAN; Deep learning; Volumetric CNN; NEURAL-NETWORK; CNN; REPRESENTATION; CLASSIFICATION;
D O I
10.1016/j.neucom.2021.06.091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the 3D object recognition problem from the perspective of the lack of labelled data. In this paper, we propose a novel progressive conditional generative adversarial network (PC-GAN) for 3D object recognition by conditioning the input with progressive learning strategies. PC-GAN is a powerful adversarial model whose generator automatically produces realistic 3D objects with annotations, and the discriminator distinguishes them from the training distribution and recognizes their categories. We train the discriminative classifier simultaneously with the generator to predict the class label by embedding a SoftMax classifier. Progressive learning uses input samples from lower to higher resolutions to increase the generator performance gradually and produce informative objects for a certain class of objects. The key idea of adopting progressing learning is to mitigate overshoots issues of the discriminator and increase variations in the generated objects by learning progressively. This strategy helps the generator to produce more realistic synthetic objects and improve the active classification performance of the discriminator. Our proposed PC-GAN is trained for object classification in a supervised manner and the performance is evaluated on two public datasets. Experimental results demonstrate that our adversarial PCGAN outperforms the existing volumetric discriminative classifiers in term of classification accuracy. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:20 / 30
页数:11
相关论文
共 51 条
[1]  
Brock A., 2016, Generative and discriminative Voxel modeling with convolutional neural networks
[2]  
Chang A.X., 2015, SHAPENET INFORMATION
[3]   3DCapsule: Extending the Capsule Architecture to Classify 3D Point Clouds [J].
Cheraghian, Ali ;
Petersson, Lars .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1194-1202
[4]  
Denton Emily, 2016, ARXIV161106430
[5]   GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification [J].
Frid-Adar, Maayan ;
Diamant, Idit ;
Klang, Eyal ;
Amitai, Michal ;
Goldberger, Jacob ;
Greenspan, Hayit .
NEUROCOMPUTING, 2018, 321 :321-331
[6]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[7]   A Comprehensive Performance Evaluation of 3D Local Feature Descriptors [J].
Guo, Yulan ;
Bennamoun, Mohammed ;
Sohel, Ferdous ;
Lu, Min ;
Wan, Jianwei ;
Kwok, Ngai Ming .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 116 (01) :66-89
[8]   3D Object Recognition in Cluttered Scenes with Local Surface Features: A Survey [J].
Guo, Yulan ;
Bennamoun, Mohammed ;
Sohel, Ferdous ;
Lu, Min ;
Wan, Jianwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (11) :2270-2287
[9]   Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era [J].
Han, Xian-Feng ;
Laga, Hamid ;
Bennamoun, Mohammed .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1578-1604
[10]  
Han ZZ, 2019, AAAI CONF ARTIF INTE, P8376