Circle Loss: A Unified Perspective of Pair Similarity Optimization

被引：700

作者：

Sun, Yifan ^{[1
]}

Cheng, Changmao ^{[1
]}

Zhang, Yuhan ^{[2
]}

Zhang, Chi ^{[1
]}

Zheng, Liang ^{[3
]}

Wang, Zhongdao ^{[4
]}

Wei, Yichen ^{[1
]}

机构：

[1] MEGVII Technol, Beijing, Peoples R China

[2] Beihang Univ, Beijing, Peoples R China

[3] Australian Natl Univ, Canberra, ACT, Australia

[4] Tsinghua Univ, Beijing, Peoples R China

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.00643

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper provides a pair similarity optimization viewpoint on deep feature learning, aiming to maximize the within-class similarity s(p) and minimize the between-class similarity s(n). We find a majority of loss functions, including the triplet loss and the softmax cross-entropy loss, embed sp and s p into similarity pairs and seek to reduce (s(n) - s(p)). Such an optimization manner is inflexible, because the penalty strength on every single similarity score is restricted to be equal. Our intuition is that if a similarity score deviates far from the optimum, it should be emphasized. To this end, we simply re-weight each similarity to highlight the less-optimized similarity scores. It results in a Circle loss, which is named due to its circular decision boundary. The Circle loss has a unified formula for two elemental deep feature learning paradigms, i.e., learning with class-level labels and pair-wise labels. Analytically, we show that the Circle loss offers a more flexible optimization approach towards a more definite convergence target, compared with the loss functions optimizing (s(n) - s(p)). Experimentally, we demonstrate the superiority of the Circle loss on a variety of deep feature learning tasks. On face recognition, person re-identification, as well as several fine-grained image retrieval datasets, the achieved performance is on par with the state of the art.

引用

页码：6397 / 6406

页数：10

共 42 条

[1]

[Anonymous], 2017, L2-constrained softmax loss for discriminative face verification

[2] Learning a similarity metric discriminatively, with application to face verification [J].

Chopra, S ;

Hadsell, R ;

LeCun, Y .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546

[3] ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].

Deng, Jiankang ;

Guo, Jia ;

Xue, Niannan ;

Zafeiriou, Stefanos .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694

[4] A new Bergeria (Flemingitaceae) from the Mississippian of Xinjiang, NW China and its evolutionary implications [J].

Feng, Ru ;

D'Rozario, Ashalata ;

Zhang, Jian-Wei .

JOURNAL OF PALAEOGEOGRAPHY-ENGLISH, 2019, 8

[5]

Ge W., 2018, EUR C COMP VIS SEPT

[6]

Hadsell R., 2006, 2006 IEEE COMP SOC C, V2, P1735, DOI DOI 10.1109/CVPR.2006.100

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8]

He L., 2019, CORR

[9] Deep Metric Learning Using Triplet Network [J].

Hoffer, Elad ;

Ailon, Nir .

SIMILARITY-BASED PATTERN RECOGNITION, SIMBAD 2015, 2015, 9370 :84-92

[10]

Huang Gary B., 2008, WORKSHOP FACESREAL L

← 1 2 3 4 5 →