Topology-Aware Graph Convolution Network for Few-Shot Incremental 3-D Object Learning

被引：0

作者：

Ma, Bingtao ^{[1
,2
,3
]}

Cong, Yang ^{[1
,2
]}

Dong, Jiahua ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China

[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110016, Peoples R China

[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 01期

基金：

中国国家自然科学基金;

关键词：

3-D meshes; class incremental learning; few-shot; graph convolution network (GCN); three-dimensional (3-D) object recognition;

D O I：

10.1109/TSMC.2023.3302008

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Three-dimensional (3-D) object recognition has achieved satisfied achievement in both academia and industry. However, most traditional 3-D object classification methods implicitly assume that there are abundant training data from a static distribution. To relax the assumption, we target on a more challenging and realistic setting: few-shot incremental 3-D object learning (FSI3DL), which intends to incrementally classify the new coming 3-D objects with few training data. In order to achieve this, two key challenges need to be concerned: 1) the catastrophic forgetting issue caused by incremental 3-D data with irregular and redundant topological structures and 2) the overfitting issue caused by few-shot training data. To address the first challenge, we use Laplacian spectral analysis based on 3-D meshes to design an embedding network that consists of super-vertex graph convolution (SVGC) module and topology-aware graph attention (TAGA) module. The SVGC is designed to construct the discriminative local topological characteristics for representing the irregular 3-D meshes better. The TAGA is designed to identify redundant topological characteristics. To address the second challenge, a fine-tuning strategy with model alignment regularization is investigated. Furthermore, an embedding space selection and fusion (ESSF) strategy is proposed in the inference phase to mitigate catastrophic forgetting and overfitting further. Combining SVGC, TAGA, and alignment regularization with ESSF strategy, a novel topology-aware graph convolution network (TopGCN) is proposed to address the FSI3DL. Experiments on representative 3-D classification datasets validate the superiority of TopGCN.

引用

页码：324 / 337

页数：14

共 60 条

[1] Ahmed E., 2018, arXiv
[2] Expert Gate: Lifelong Learning with a Network of Experts
Aljundi, Rahaf
Chakravarty, Punarjay
Tuytelaars, Tinne
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7120 - 7129
[3] [Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298801
[4] End-to-End Incremental Learning
Castro, Francisco M.
Marin-Jimenez, Manuel J.
Guil, Nicolas
Schmid, Cordelia
Alahari, Karteek
[J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 241 - 257
[5] Chang Angel X., 2015, arXiv
[6] Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence
Chaudhry, Arslan
Dokania, Puneet K.
Ajanthan, Thalaiyasingam
Torr, Philip H. S.
[J]. COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 556 - 572
[7] Deep Meta Metric Learning
Chen, Guangyi
Zhang, Tianren
Lu, Jiwen
Zhou, Jie
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9546 - 9555
[8] Chen K., 2021, P ICLR
[9] Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning
Cheraghian, Ali
Rahman, Shafin
Fang, Pengfei
Roy, Soumava Kumar
Petersson, Lars
Harandi, Mehrtash
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2534 - 2543
[10] A Comprehensive Study of 3-D Vision-Based Robot Manipulation
Cong, Yang
Chen, Ronghan
Ma, Bingtao
Liu, Hongsen
Hou, Dongdong
Yang, Chenguang
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1682 - 1698

← 1 2 3 4 5 6 →