Boosting Contrastive Learning with Relation Knowledge Distillation

被引：0

作者：

Zheng, Kai ^{[1
]}

Wang, Yuanjiang ^{[1
]}

Yuan, Ye ^{[1
]}

机构：

[1] Megvii Technol, Beijing, Peoples R China

来源：

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While self-supervised representation learning (SSL) has proved to be effective in the large model, there is still a huge gap between the SSL and supervised method in the lightweight model when following the same solution. We delve into this problem and find that the lightweight model is prone to collapse in semantic space when simply performing instance-wise contrast. To address this issue, we propose a relation-wise contrastive paradigm with Relation Knowledge Distillation (ReKD). We introduce a heterogeneous teacher to explicitly mine the semantic information and transferring a novel relation knowledge to the student (lightweight model). The theoretical analysis supports our main concern about instance-wise contrast and verify the effectiveness of our relation-wise contrastive learning. Extensive experimental results also demonstrate that our method achieves significant improvements on multiple lightweight models. Particularly, the linear evaluation on AlexNet obviously improves the current state-of-art from 44.7% to 50.1% , which is the first work to get close to the supervised (50.5%). Code will be made available.

引用

页码：3508 / 3516

页数：9

共 34 条

[1]

[Anonymous], 2020, ADV NEURAL INFORM PR, DOI DOI 10.1029/2019PA003809

[2] Deep Clustering for Unsupervised Learning of Visual Features [J].

Caron, Mathilde ;

Bojanowski, Piotr ;

Joulin, Armand ;

Douze, Matthijs .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156

[3]

Chang JL, 2017, IEEE I CONF COMP VIS, P5880, DOI [10.1109/ICCV.2017.626, 10.1109/ICCV.2017.627]

[4] Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation [J].

Chen, Xiaocong ;

Huang, Chaoran ;

Yao, Lina ;

Wang, Xianzhi ;

Liu, Wei ;

Zhang, Wenjie .

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,

[5] A Two-Teacher Framework for Knowledge Distillation [J].

Chen, Xingjian ;

Su, Jianbo ;

Zhang, Jun .

ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT I, 2019, 11554 :58-66

[6] Exploring Simple Siamese Representation Learning [J].

Chen, Xinlei ;

He, Kaiming .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15745-15753

[7]

Fang Z., 2020, ARXIV210104731

[8] Self-Supervised Representation Learning by Rotation Feature Decoupling [J].

Feng, Zeyu ;

Xu, Chang ;

Tao, Dacheng .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10356-10366

[9]

Gidaris S., 2018, INT C LEARN REPR ICL

[10]

Grill, 2020, arXiv

← 1 2 3 4 →