Dual Part Discovery Network for Zero-Shot Learning

被引：12

作者：

Ge, Jiannan ^{[1
]}

Xie, Hongtao ^{[1
]}

Min, Shaobo ^{[2
]}

Li, Pandeng ^{[1
]}

Zhang, Yongdong ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Peoples R China

[2] Tencent Data Platform, Shenzhen, Peoples R China

来源：

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年

关键词：

Zero-shot learning; object recognition; joint embedding;

D O I：

10.1145/3503161.3547889

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Zero-Shot Learning (ZSL) aims to recognize unseen classes by transferring knowledge from seen classes. Recent methods focus on learning a common semantic space to align visual and attribute information. However, they always over-relied on provided attributes and ignored the category discriminative information that contributes to accurate unseen class recognition, resulting in weak transferability. To this end, we propose a novel Dual Part Discovery Network (DPDN) that considers both attribute and category discriminative information by discovering attribute-guided parts and category-guided parts simultaneously to improve knowledge transfer. Specifically, for attribute-guided parts discovery, DPDN can localize the regions with specific attribute information and significantly bridge the gap between visual and semantic information guided by the given attributes. For category-guided parts discovery, the local parts are explored to discover other important regions that bring latent crucial details ignored by attributes, with the guidance of adaptive category prototypes. To better mine the transferable knowledge, we impose class correlations constraints to regularize the category prototypes. Finally, attribute- and category-guided parts complement each other and provide adequate discriminative subtle information for more accurate unseen class recognition. Extensive experimental results demonstrate that DPDN can discover discriminative parts and outperform state-of-the-art methods on three standard benchmarks.

引用

页码：3244 / 3252

页数：9

共 53 条

[1] Label-Embedding for Attribute-Based Classification [J].

Akata, Zeynep ;

Perronnin, Florent ;

Harchaoui, Zaid ;

Schmid, Cordelia .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :819-826

[2]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00961

[3]

Carion N., 2020, EUROPEAN C COMPUTER, V12346, P213, DOI 10.1007/978-3-030-58452-8_13

[4] An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild [J].

Chao, Wei-Lun ;

Changpinyo, Soravit ;

Gong, Boqing ;

Sha, Fei .

COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :52-68

[5]

Chen S., 2022, P AAAI

[6] FREE: Feature Refinement for Generalized Zero-Shot Learning [J].

Chen, Shiming ;

Wang, Wenjie ;

Xia, Beihao ;

Peng, Qinmu ;

You, Xinge ;

Zheng, Feng ;

Shao, Ling .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :122-131

[7]

Chen Shiming, 2022, IEEE CVF C COMP VIS

[8]

Chen Xinlei, 2020, Autophagy, DOI DOI 10.1080/15548627.2020.1810918

[9]

Chen Z., 2021, ICCV

[10] Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches [J].

Chen, Zhi ;

Wang, Sen ;

Li, Jingjing ;

Huang, Zi .

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :3413-3421

← 1 2 3 4 5 6 →