Multi-View CNN Feature Aggregation with ELM Auto-Encoder for 3D Shape Recognition

被引:0
|
作者
Zhi-Xin Yang
Lulu Tang
Kun Zhang
Pak Kin Wong
机构
[1] University of Macau,Department of Electromechanical Engineering, Faculty of Science and Technology
来源
Cognitive Computation | 2018年 / 10卷
关键词
ELM auto-encoder; Convolutional neural networks; 3D shape recognition; Multi-view feature aggregation;
D O I
暂无
中图分类号
学科分类号
摘要
Fast and accurate detection of 3D shapes is a fundamental task of robotic systems for intelligent tracking and automatic control. View-based 3D shape recognition has attracted increasing attention because human perceptions of 3D objects mainly rely on multiple 2D observations from different viewpoints. However, most existing multi-view-based cognitive computation methods use straightforward pairwise comparisons among the projected images then follow with weak aggregation mechanism, which results in heavy computation cost and low recognition accuracy. To address such problems, a novel network structure combining multi-view convolutional neural networks (M-CNNs), extreme learning machine auto-encoder (ELM-AE), and ELM classifer, named as MCEA, is proposed for comprehensive feature learning, effective feature aggregation, and efficient classification of 3D shapes. Such novel framework exploits the advantages of deep CNN architecture with the robust ELM-AE feature representation, as well as the fast ELM classifier for 3D model recognition. Compared with the existing set-to-set image comparison methods, the proposed shape-to-shape matching strategy could convert each high informative 3D model into a single compact feature descriptor via cognitive computation. Moreover, the proposed method runs much faster and obtains a good balance between classification accuracy and computational efficiency. Experimental results on the benchmarking Princeton ModelNet, ShapeNet Core 55, and PSB datasets show that the proposed framework achieves higher classification and retrieval accuracy in much shorter time than the state-of-the-art methods.
引用
收藏
页码:908 / 921
页数:13
相关论文
共 32 条
  • [31] Semi- and Self-supervised Multi-view Fusion of 3D Microscopy Images Using Generative Adversarial Networks
    Yang, Canyu
    Eschweiler, Dennis
    Stegmaier, Johannes
    MACHINE LEARNING FOR MEDICAL IMAGE RECONSTRUCTION (MLMIR 2021), 2021, 12964 : 130 - 139
  • [32] MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation
    Ao, Yu
    Shi, Weili
    Ji, Bai
    Miao, Yu
    He, Wei
    Jiang, Zhengang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170