Large-Margin Regularized Softmax Cross-Entropy Loss

被引:32
作者
Li, Xiaoxu [1 ]
Chang, Dongliang [1 ]
Tian, Tao [1 ]
Cao, Jie [1 ]
机构
[1] Lanzhou Univ Technol, Sch Comp & Commun, Lanzhou 730050, Gansu, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural networks; cross-entropy loss; large-margin regularization; NEURAL-NETWORKS; DEEP;
D O I
10.1109/ACCESS.2019.2897692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Softmax cross-entropy loss with L2 regularization is commonly adopted in the machine learning and neural network community. Considering that the traditional softmax cross-entropy loss simply focuses on fitting or classifying the training data accurately but does not explicitly encourage a large decision margin for classification, some loss functions are proposed to improve the generalization performance by solving the problem. However, these loss functions enhance the difficulty of model optimization. In addition, inspired by regularized logistic regression, where the regularized term is responsible for adjusting the width of decision margin, which can be seen as an approximation of support vector machine, we proposed a large-margin regularization method for softmax cross-entropy loss. The advantages of the proposed loss are twofold as follows: the first is the generalization performance improvement, and the second is easy optimization. The experimental results on three small-sample datasets show that our regularization method achieves good performance and outperforms the existing popular regularization methods of neural networks.
引用
收藏
页码:19572 / 19578
页数:7
相关论文
共 44 条
[1]  
Akaike H., 1998, 2 INT S INF THEOR, P199, DOI 10.1007/978-1-4612-1694-015
[2]  
[Anonymous], 2017, LEARNING DEEP FEATUR
[3]   Person Re-Identification by Camera Correlation Aware Feature Augmentation [J].
Chen, Ying-Cong ;
Zhu, Xiatian ;
Zheng, Wei-Shi ;
Lai, Jian-Huang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (02) :392-408
[4]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[5]   Multi-Instance Multi-Label Learning Combining Hierarchical Context and its Application to Image Annotation [J].
Ding, Xinmiao ;
Li, Bing ;
Xiong, Weihua ;
Guo, Wen ;
Hu, Weiming ;
Wang, Bo .
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (08) :1616-1627
[6]   Representing and Retrieving Video Shots in Human-Centric Brain Imaging Space [J].
Han, Junwei ;
Ji, Xiang ;
Hu, Xintao ;
Zhu, Dajiang ;
Li, Kaiming ;
Jiang, Xi ;
Cui, Guangbin ;
Guo, Lei ;
Liu, Tianming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (07) :2723-2736
[7]   Support vector machines [J].
Hearst, MA .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (04) :18-21
[8]  
Jaderberg M., 2015, Neural Inf. Process. Syst., V28, P2017, DOI DOI 10.48550/ARXIV.1506.02025
[9]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[10]  
Lazebnik S., 2006, 2006 IEEE COMP SOC C, VVolume 2, P2169, DOI DOI 10.1109/CVPR.2006.68