Detachable Second-Order Pooling: Toward High-Performance First-Order Networks

被引:2
作者
Li, Lida [1 ]
Xie, Jiangtao [2 ]
Li, Peihua [2 ]
Zhang, Lei [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[2] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China
关键词
Training; Knowledge engineering; Task analysis; Covariance matrices; Correlation; Complexity theory; Visualization; First-order networks; image classification; second-order pooling;
D O I
10.1109/TNNLS.2021.3052829
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Second-order pooling has proved to be more effective than its first-order counterpart in visual classification tasks. However, second-order pooling suffers from the high demand for a computational resource, limiting its use in practical applications. In this work, we present a novel architecture, namely a detachable second-order pooling network, to leverage the advantage of second-order pooling by first-order networks while keeping the model complexity unchanged during inference. Specifically, we introduce second-order pooling at the end of a few auxiliary branches and plug them into different stages of a convolutional neural network. During the training stage, the auxiliary second-order pooling networks assist the backbone first-order network to learn more discriminative feature representations. When training is completed, all auxiliary branches can be removed, and only the backbone first-order network is used for inference. Experiments conducted on CIFAR-10, CIFAR-100, and ImageNet data sets clearly demonstrated the leading performance of our network, which achieves even higher accuracy than second-order networks but keeps the low inference complexity of first-order networks.
引用
收藏
页码:3400 / 3414
页数:15
相关论文
共 53 条
[1]  
[Anonymous], 2017, arXiv preprint arxiv 1712.07628
[2]   Geometric means in a novel vector space structure on symmetric positive-definite matrices [J].
Arsigny, Vincent ;
Fillard, Pierre ;
Pennec, Xavier ;
Ayache, Nicholas .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2007, 29 (01) :328-347
[3]   Higher-order Integration of Hierarchical Convolutional Activations for Fine-grained Visual Categorization [J].
Cai, Sijia ;
Zuo, Wangmeng ;
Zhang, Lei .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :511-520
[4]   Data Augmentation-Based Joint Learning for Heterogeneous Face Recognition [J].
Cao, Bing ;
Wang, Nannan ;
Li, Jie ;
Gao, Xinbo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (06) :1731-1743
[5]   Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function [J].
Cheng, De ;
Gong, Yihong ;
Zhou, Sanping ;
Wang, Jinjun ;
Zheng, Nanning .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1335-1344
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[8]  
Furlanello T, 2018, PR MACH LEARN RES, V80
[9]   Compact Bilinear Pooling [J].
Gao, Yang ;
Beijbom, Oscar ;
Zhang, Ning ;
Darrell, Trevor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :317-326
[10]   Cross Modal Distillation for Supervision Transfer [J].
Gupta, Saurabh ;
Hoffman, Judy ;
Malik, Jitendra .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2827-2836