Circle Loss: A Unified Perspective of Pair Similarity Optimization

被引:700
作者
Sun, Yifan [1 ]
Cheng, Changmao [1 ]
Zhang, Yuhan [2 ]
Zhang, Chi [1 ]
Zheng, Liang [3 ]
Wang, Zhongdao [4 ]
Wei, Yichen [1 ]
机构
[1] MEGVII Technol, Beijing, Peoples R China
[2] Beihang Univ, Beijing, Peoples R China
[3] Australian Natl Univ, Canberra, ACT, Australia
[4] Tsinghua Univ, Beijing, Peoples R China
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.00643
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper provides a pair similarity optimization viewpoint on deep feature learning, aiming to maximize the within-class similarity s(p) and minimize the between-class similarity s(n). We find a majority of loss functions, including the triplet loss and the softmax cross-entropy loss, embed sp and s p into similarity pairs and seek to reduce (s(n) - s(p)). Such an optimization manner is inflexible, because the penalty strength on every single similarity score is restricted to be equal. Our intuition is that if a similarity score deviates far from the optimum, it should be emphasized. To this end, we simply re-weight each similarity to highlight the less-optimized similarity scores. It results in a Circle loss, which is named due to its circular decision boundary. The Circle loss has a unified formula for two elemental deep feature learning paradigms, i.e., learning with class-level labels and pair-wise labels. Analytically, we show that the Circle loss offers a more flexible optimization approach towards a more definite convergence target, compared with the loss functions optimizing (s(n) - s(p)). Experimentally, we demonstrate the superiority of the Circle loss on a variety of deep feature learning tasks. On face recognition, person re-identification, as well as several fine-grained image retrieval datasets, the achieved performance is on par with the state of the art.
引用
收藏
页码:6397 / 6406
页数:10
相关论文
共 42 条
[1]  
[Anonymous], 2017, L2-constrained softmax loss for discriminative face verification
[2]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[3]   ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].
Deng, Jiankang ;
Guo, Jia ;
Xue, Niannan ;
Zafeiriou, Stefanos .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694
[4]   A new Bergeria (Flemingitaceae) from the Mississippian of Xinjiang, NW China and its evolutionary implications [J].
Feng, Ru ;
D'Rozario, Ashalata ;
Zhang, Jian-Wei .
JOURNAL OF PALAEOGEOGRAPHY-ENGLISH, 2019, 8
[5]  
Ge W., 2018, EUR C COMP VIS SEPT
[6]  
Hadsell R., 2006, 2006 IEEE COMP SOC C, V2, P1735, DOI DOI 10.1109/CVPR.2006.100
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
He L., 2019, CORR
[9]   Deep Metric Learning Using Triplet Network [J].
Hoffer, Elad ;
Ailon, Nir .
SIMILARITY-BASED PATTERN RECOGNITION, SIMBAD 2015, 2015, 9370 :84-92
[10]  
Huang Gary B., 2008, WORKSHOP FACESREAL L