Soft-Margin Softmax for Deep Classification

被引：60

作者：

Liang, Xuezhi ^{[1
,2
,3
,4
]}

Wang, Xiaobo ^{[1
,2
,4
]}

Lei, Zhen ^{[1
,2
,4
]}

Liao, Shengcai ^{[1
,2
,4
]}

Li, Stan Z. ^{[1
,2
,3
,4
]}

机构：

[1] Chinese Acad Sci, Ctr Biometr & Secur Res, Beijing, Peoples R China

[2] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing, Peoples R China

[3] Chinese Acad Sci, Ctr Internet Things, Wuxi, Jiangsu, Peoples R China

[4] Univ Chinese Acad Sci, Beijing, Peoples R China

来源：

NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II | 2017年 / 10635卷

关键词：

CNN; Softmax; L-Softmax; SM-Softmax; Classification;

D O I：

10.1007/978-3-319-70096-0_43

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In deep classification, the softmax loss (Softmax) is arguably one of the most commonly used components to train deep convolutional neural networks (CNNs). However, such a widely used loss is limited due to its lack of encouraging the discriminability of features. Recently, the large-margin softmax loss (L-Softmax [1]) is proposed to explicitly enhance the feature discrimination, with hard margin and complex forward and backward computation. In this paper, we propose a novel soft-margin softmax (SM-Softmax) loss to improve the discriminative power of features. Specifically, SM-Softamx only modifies the forward of Softmax by introducing a non-negative real number m, without changing the backward. Thus it can not only adjust the desired continuous soft margin but also be easily optimized by the typical stochastic gradient descent (SGD). Experimental results on three benchmark datasets have demonstrated the superiority of our SM-Softmax over the baseline Softmax, the alternative L-Softmax and several state-of-the-art competitors.

引用

页码：413 / 421

页数：9

共 28 条

[1]

[Anonymous], 2015, AISTATS

[2]

[Anonymous], 2014, ICLR

[3]

[Anonymous], 2016, ICML

[4]

[Anonymous], 2014, NIPS

[5]

[Anonymous], 2013, arXiv

[6]

[Anonymous], 2015, CVPR

[7]

[Anonymous], 2011, AISTATS

[8]

[Anonymous], 2013, ICML

[9]

[Anonymous], 2014, JMLR

[10]

[Anonymous], 2009, ICCV

← 1 2 3 →