Topology-Aware Graph Convolution Network for Few-Shot Incremental 3-D Object Learning

被引:0
作者
Ma, Bingtao [1 ,2 ,3 ]
Cong, Yang [1 ,2 ]
Dong, Jiahua [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110016, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 01期
基金
中国国家自然科学基金;
关键词
3-D meshes; class incremental learning; few-shot; graph convolution network (GCN); three-dimensional (3-D) object recognition;
D O I
10.1109/TSMC.2023.3302008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional (3-D) object recognition has achieved satisfied achievement in both academia and industry. However, most traditional 3-D object classification methods implicitly assume that there are abundant training data from a static distribution. To relax the assumption, we target on a more challenging and realistic setting: few-shot incremental 3-D object learning (FSI3DL), which intends to incrementally classify the new coming 3-D objects with few training data. In order to achieve this, two key challenges need to be concerned: 1) the catastrophic forgetting issue caused by incremental 3-D data with irregular and redundant topological structures and 2) the overfitting issue caused by few-shot training data. To address the first challenge, we use Laplacian spectral analysis based on 3-D meshes to design an embedding network that consists of super-vertex graph convolution (SVGC) module and topology-aware graph attention (TAGA) module. The SVGC is designed to construct the discriminative local topological characteristics for representing the irregular 3-D meshes better. The TAGA is designed to identify redundant topological characteristics. To address the second challenge, a fine-tuning strategy with model alignment regularization is investigated. Furthermore, an embedding space selection and fusion (ESSF) strategy is proposed in the inference phase to mitigate catastrophic forgetting and overfitting further. Combining SVGC, TAGA, and alignment regularization with ESSF strategy, a novel topology-aware graph convolution network (TopGCN) is proposed to address the FSI3DL. Experiments on representative 3-D classification datasets validate the superiority of TopGCN.
引用
收藏
页码:324 / 337
页数:14
相关论文
共 60 条
  • [1] Ahmed E., 2018, arXiv
  • [2] Expert Gate: Lifelong Learning with a Network of Experts
    Aljundi, Rahaf
    Chakravarty, Punarjay
    Tuytelaars, Tinne
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7120 - 7129
  • [3] [Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298801
  • [4] End-to-End Incremental Learning
    Castro, Francisco M.
    Marin-Jimenez, Manuel J.
    Guil, Nicolas
    Schmid, Cordelia
    Alahari, Karteek
    [J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 241 - 257
  • [5] Chang Angel X., 2015, arXiv
  • [6] Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence
    Chaudhry, Arslan
    Dokania, Puneet K.
    Ajanthan, Thalaiyasingam
    Torr, Philip H. S.
    [J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 556 - 572
  • [7] Deep Meta Metric Learning
    Chen, Guangyi
    Zhang, Tianren
    Lu, Jiwen
    Zhou, Jie
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9546 - 9555
  • [8] Chen K., 2021, P ICLR
  • [9] Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning
    Cheraghian, Ali
    Rahman, Shafin
    Fang, Pengfei
    Roy, Soumava Kumar
    Petersson, Lars
    Harandi, Mehrtash
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2534 - 2543
  • [10] A Comprehensive Study of 3-D Vision-Based Robot Manipulation
    Cong, Yang
    Chen, Ronghan
    Ma, Bingtao
    Liu, Hongsen
    Hou, Dongdong
    Yang, Chenguang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1682 - 1698