Knowledge distillation (KD) has become a widely used technique for model compression and knowledge transfer. We find that the standard KD method performs the knowledge alignment on an individual sample indirectly via class prototypes and neglects the structural knowledge between different samples, namely, knowledge correlation. Although recent contrastive learning-based distillation methods can be decomposed into knowledge alignment and correlation, their correlation objectives undesirably push apart representations of samples from the same class, leading to inferior distillation results. To improve the distillation performance, in this work, we propose a novel knowledge correlation objective and introduce the dual-level knowledge distillation (DLKD), which explicitly combines knowledge alignment and correlation together instead of using one single contrastive objective. We show that both knowledge alignment and correlation are necessary to improve the distillation performance. In particular, knowledge correlation can serve as an effective regularization to learn generalized representations. The proposed DLKD is task-agnostic and model-agnostic, and enables effective knowledge transfer from supervised or self-supervised pretrained teachers to students. Experiments show that DLKD outperforms other state-of-the-art methods on a large number of experimental settings including: 1) pretraining strategies; 2) network architectures; 3) datasets; and 4) tasks.
机构:
Zhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R ChinaZhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R China
Ye, Xin
Jiang, Rongxin
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R China
Zhejiang Prov Key Lab Network Multimedia Technol, Hangzhou 310027, Peoples R ChinaZhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R China
Jiang, Rongxin
Tian, Xiang
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R China
Zhejiang Prov Key Lab Network Multimedia Technol, Hangzhou 310027, Peoples R ChinaZhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R China
Tian, Xiang
Zhang, Rui
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R ChinaZhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R China
Zhang, Rui
Chen, Yaowu
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R China
Minist Educ China, Embedded Syst Engn Res Ctr, Hangzhou 310027, Peoples R ChinaZhejiang Univ, Inst Adv Digital Technol & Instrumentat, Hangzhou 310027, Peoples R China
机构:
Xiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R ChinaXiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
Li, Shaojie
Lin, Mingbao
论文数: 0引用数: 0
h-index: 0
机构:
Xiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
Tencent, Youtu Lab, Shanghai 200233, Peoples R ChinaXiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
Lin, Mingbao
Wang, Yan
论文数: 0引用数: 0
h-index: 0
机构:
Pinterest, Seattle, WA 98101 USAXiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
Wang, Yan
Wu, Yongjian
论文数: 0引用数: 0
h-index: 0
机构:
Tencent, Youtu Lab, Shanghai 200233, Peoples R ChinaXiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
Wu, Yongjian
Tian, Yonghong
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R ChinaXiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
Tian, Yonghong
Shao, Ling
论文数: 0引用数: 0
h-index: 0
机构:
Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Mohamed bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab EmiratesXiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
Shao, Ling
Ji, Rongrong
论文数: 0引用数: 0
h-index: 0
机构:
Xiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
Xiamen Univ, Inst Artificial Intelligence, Xiamen 361005, Peoples R ChinaXiamen Univ, Sch Informat, Media Analyt & Comp Lab, Dept Artificial Intelligence, Xiamen 361005, Peoples R China
机构:
Shanghai Jiao Tong Univ, AI Inst, John Hopcroft Ctr, Dept Comp Sci & Engn,MoE Key Lab Artificial Intell, Shanghai 200240, Peoples R ChinaShanghai Jiao Tong Univ, AI Inst, John Hopcroft Ctr, Dept Comp Sci & Engn,MoE Key Lab Artificial Intell, Shanghai 200240, Peoples R China
Zhang, Quanshi
Cheng, Xu
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Jiao Tong Univ, AI Inst, John Hopcroft Ctr, Dept Comp Sci & Engn,MoE Key Lab Artificial Intell, Shanghai 200240, Peoples R ChinaShanghai Jiao Tong Univ, AI Inst, John Hopcroft Ctr, Dept Comp Sci & Engn,MoE Key Lab Artificial Intell, Shanghai 200240, Peoples R China
Cheng, Xu
Chen, Yilan
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Informat & Commun Engn, Xian 710049, Shaanxi, Peoples R ChinaShanghai Jiao Tong Univ, AI Inst, John Hopcroft Ctr, Dept Comp Sci & Engn,MoE Key Lab Artificial Intell, Shanghai 200240, Peoples R China
Chen, Yilan
Rao, Zhefan
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Univ Sci & Technol, Comp Sci & Engn, Clear Water Bay, Hong Kong, Peoples R ChinaShanghai Jiao Tong Univ, AI Inst, John Hopcroft Ctr, Dept Comp Sci & Engn,MoE Key Lab Artificial Intell, Shanghai 200240, Peoples R China