Improving Classification Performance of Softmax Loss Function Based on Scalable Batch-Normalization

被引：43

作者：

Zhu, Qiuyu ^{[1
]}

He, Zikuang ^{[1
]}

Zhang, Tao ^{[2
]}

Cui, Wennan ^{[2
]}

机构：

[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China

[2] Chinese Acad Sci, Key Lab Intelligent Infrared Percept, Shanghai 200083, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 08期

关键词：

convolutional neural network; loss function; gradient decent;

D O I：

10.3390/app10082950

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Featured Application This work can be widely used in all kinds of pattern recognition systems based on deep learning, such as face recognition, license plate recognition, and speech recognition, etc. Abstract Convolutional neural networks (CNNs) have made great achievements on computer vision tasks, especially the image classification. With the improvement of network structure and loss functions, the performance of image classification is getting higher and higher. The classic Softmax + cross-entropy loss has been the norm for training neural networks for years, which is calculated from the output probability of the ground-truth class. Then the network's weight is updated by gradient calculation of the loss. However, after several epochs of training, the back-propagation errors usually become almost negligible. For the above considerations, we proposed that batch normalization with adjustable scale could be added after network output to alleviate the problem of vanishing gradient problem in deep learning. The experimental results show that our method can significantly improve the final classification accuracy on different network structures, and is also better than many other improved classification Loss.

引用

页数：8

共 14 条

[1]

[Anonymous], 2015, ARXIV PREPRINT ARXIV

[2]

[Anonymous], 2017, ADV SOC SCI EDUC HUM

[3]

[Anonymous], TR2009 U TOR

[4]

[Anonymous], EFFICIENT BACKPROP N

[5]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[6] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[7] Densely Connected Convolutional Networks [J].

Huang, Gao ;

Liu, Zhuang ;

van der Maaten, Laurens ;

Weinberger, Kilian Q. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269

[8]

Liu W., 2016, ICML, V2, P7

[9]

Simonyan K., 2014, 14091556 ARXIV

[10] NormFace: L2 Hypersphere Embedding for Face Verification [J].

Wang, Feng ;

Xiang, Xiang ;

Cheng, Jian ;

Yuille, Alan L. .

PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, :1041-1049

← 1 2 →