G-Softmax: Improving Intraclass Compactness and Interclass Separability of Features

被引:39
作者
Luo, Yan [1 ]
Wong, Yongkang [2 ]
Kankanhalli, Mohan [2 ]
Zhao, Qi [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
[2] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
基金
美国国家科学基金会; 新加坡国家研究基金会;
关键词
Compactness and separability; deep learning; Gaussian-based softmax; multilabel classification; DEEP; CLASSIFICATION; RECOGNITION; NETWORKS;
D O I
10.1109/TNNLS.2019.2909737
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intraclass compactness and interclass separability are crucial indicators to measure the effectiveness of a model to produce discriminative features, where intraclass compactness indicates how close the features with the same label are to each other and interclass separability indicates how far away the features with different labels are. In this paper, we investigate intraclass compactness and interclass separability of features learned by convolutional networks and propose a Gaussian-based softmax ( G-softmax) function that can effectively improve intraclass compactness and interclass separability. The proposed function is simple to implement and can easily replace the softmax function. We evaluate the proposed G-softmax function on classification data sets (i.e., CIFAR-10, CIFAR-100, and Tiny ImageNet) and on multilabel classification data sets (i.e., MS COCO and NUS-WIDE). The experimental results show that the proposed G-softmax function improves the state-of-the-art models across all evaluated data sets. In addition, the analysis of the intraclass compactness and interclass separability demonstrates the advantages of the proposed function over the softmax function, which is consistent with the performance improvement. More importantly, we observe that high intraclass compactness and interclass separability are linearly correlated with average precision on MS COCO and NUS-WIDE. This implies that the improvement of intraclass compactness and interclass separability would lead to the improvement of average precision.
引用
收藏
页码:685 / 699
页数:15
相关论文
共 69 条
[1]   Identification of diagnostic markers for tuberculosis by proteomic fingerprinting of serum [J].
Agranoff, Dan ;
Fernandez-Reyes, Delmiro ;
Papadopoulos, Marios C. ;
Rojas, Sergio A. ;
Herbster, Mark ;
Loosemore, Alison ;
Tarelli, Edward ;
Sheldon, Jo ;
Schwenk, Achim ;
Pollak, Richard ;
Rayner, Charlotte F. J. ;
Krishna, Sarjeev .
LANCET, 2006, 368 (9540) :1012-1021
[2]  
[Anonymous], 2012, P ICML ED UK
[3]  
[Anonymous], 2015, P IEEE C COMP VIS PA
[4]  
[Anonymous], 2016, P 4 INT C LEARN REPR
[5]  
[Anonymous], 2009, 4 U TOR
[6]  
[Anonymous], 2011, BIGLEARN NIPS WORKSH
[7]  
[Anonymous], 2015, 3 INT C LEARN REPR I
[8]  
[Anonymous], 2017, CVPR
[9]   Semantic Pooling for Complex Event Analysis in Untrimmed Videos [J].
Chang, Xiaojun ;
Yu, Yao-Liang ;
Yang, Yi ;
Xing, Eric P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) :1617-1632
[10]  
Chen S.-F., 2017, ORDER FREE RNN VISUA