CoDi: Contrastive Disentanglement Generative Adversarial Networks for Zero-Shot Sketch-Based 3D Shape Retrieval

被引：0

作者：

Meng, Min ^{[1
]}

Chen, Wenhang ^{[1
]}

Liu, Jigang ^{[2
]}

Yu, Jun ^{[3
,4
]}

Wu, Jigang ^{[1
]}

机构：

[1] Guangdong Univ Technol, Sch Comp Sci, Guangzhou 510006, Peoples R China

[2] Ping An Life Insurance China, Shenzhen 518046, Peoples R China

[3] Harbin Inst Technol, Dept Comp Sci & Technol, Shenzhen 518055, Peoples R China

[4] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2025年 / 35卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Shape; Three-dimensional displays; Semantics; Circuits and systems; Prototypes; Contrastive learning; Zero shot learning; Feature extraction; Training; Computational modeling; Sketch-based shape retrieval; zero-shot learning; disentanglement; contrastive learning; REPRESENTATION;

D O I：

10.1109/TCSVT.2024.3472036

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Sketch-based 3D shape retrieval has attracted increasing attention in recent years. Most existing methods fail to address the zero-shot scenario, and the few dedicated to zero-shot learning encounter the following two issues: 1) the features learned by these methods lack informativeness and generalization, rendering them ineffective in identifying unseen samples; 2) the generation of low-quality samples, aimed at facilitating the recognition of unseen categories, paradoxically diminishes their ability to identify these unseen classes. This paper introduces a novel contrastive disentanglement generative adversarial networks (CoDi) tailored for zero-shot sketch-based 3D shape retrieval. Initially, we introduce a paradoxical feature construction approach designed to assist the networks in capturing certain low-level features. Despite their weak semantic relevance, these features play a crucial role in sample recognition. Subsequently, a SemContrast fusion module is employed to align the semantic space with the prototype embedding space of categories. This alignment facilitates knowledge transfer to unseen classes and promotes the generation of high-quality samples. The networks are jointly trained on real and generated samples to achieve retrieval for unseen categories. Extensive experiments demonstrate a significant improvement in retrieval performance for unseen categories using our method.

引用

页码：1910 / 1920

页数：11

共 49 条

[1] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

[2]

Chang Jia-Ren, 2021, P IEEE CVF INT C COM, P9680

[3] DisenDreamer: Subject-Driven Text-to-Image Generation With Sample-Aware Disentangled Tuning [J].

Chen, Hong ;

Zhang, Yipeng ;

Wang, Xin ;

Duan, Xuguang ;

Zhou, Yuwei ;

Zhu, Wenwu .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) :6860-6873

[4] Deep Cross-Modality Adaptation via Semantics Preserving Adversarial Learning for Sketch-Based 3D Shape Retrieval [J].

Chen, Jiaxin ;

Fang, Yi .

COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :624-640

[5]

Chen T, 2020, PR MACH LEARN RES, V119

[6] Deep Correlated Holistic Metric Learning for Sketch-Based 3D Shape Retrieval [J].

Dai, Guoxian ;

Xie, Jin ;

Fang, Yi .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) :3374-3386

[7] CROSS-MODAL GUIDANCE NETWORK FOR SKETCH-BASED 3D SHAPE RETRIEVAL [J].

Dai, Weidong ;

Liang, Shuang .

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,

[8] Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval [J].

Deng, Cheng ;

Xu, Xinxun ;

Wang, Hao ;

Yang, Muli ;

Tao, Dacheng .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :8892-8902

[9] Systematic review with meta-analysis: the critical role of dermatological events in patients with hepatocellular carcinoma treated with sorafenib [J].

Diaz-Gonzalez, Alvaro ;

Sanduzzi-Zamparelli, Marco ;

Sapena, Victor ;

Torres, Ferran ;

LLarch, Neus ;

Iserte, Gemma ;

Forner, Alejandro ;

da Fonseca, Leonardo ;

Rios, Jose ;

Bruix, Jordi ;

Reig, Maria .

ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2019, 49 (05) :482-491

[10]

Dong SY, 2024, Arxiv, DOI arXiv:2401.16459

← 1 2 3 4 5 →