Detachable Second-Order Pooling: Toward High-Performance First-Order Networks

被引：2

作者：

Li, Lida ^{[1
]}

Xie, Jiangtao ^{[2
]}

Li, Peihua ^{[2
]}

Zhang, Lei ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

[2] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116024, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2022年 / 33卷 / 08期

关键词：

Training; Knowledge engineering; Task analysis; Covariance matrices; Correlation; Complexity theory; Visualization; First-order networks; image classification; second-order pooling;

D O I：

10.1109/TNNLS.2021.3052829

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Second-order pooling has proved to be more effective than its first-order counterpart in visual classification tasks. However, second-order pooling suffers from the high demand for a computational resource, limiting its use in practical applications. In this work, we present a novel architecture, namely a detachable second-order pooling network, to leverage the advantage of second-order pooling by first-order networks while keeping the model complexity unchanged during inference. Specifically, we introduce second-order pooling at the end of a few auxiliary branches and plug them into different stages of a convolutional neural network. During the training stage, the auxiliary second-order pooling networks assist the backbone first-order network to learn more discriminative feature representations. When training is completed, all auxiliary branches can be removed, and only the backbone first-order network is used for inference. Experiments conducted on CIFAR-10, CIFAR-100, and ImageNet data sets clearly demonstrated the leading performance of our network, which achieves even higher accuracy than second-order networks but keeps the low inference complexity of first-order networks.

引用

页码：3400 / 3414

页数：15

共 53 条

[1]

[Anonymous], 2017, arXiv preprint arxiv 1712.07628

[2] Geometric means in a novel vector space structure on symmetric positive-definite matrices [J].

Arsigny, Vincent ;

Fillard, Pierre ;

Pennec, Xavier ;

Ayache, Nicholas .

SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2007, 29 (01) :328-347

[3] Higher-order Integration of Hierarchical Convolutional Activations for Fine-grained Visual Categorization [J].

Cai, Sijia ;

Zuo, Wangmeng ;

Zhang, Lei .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :511-520

[4] Data Augmentation-Based Joint Learning for Heterogeneous Face Recognition [J].

Cao, Bing ;

Wang, Nannan ;

Li, Jie ;

Gao, Xinbo .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (06) :1731-1743

[5] Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function [J].

Cheng, De ;

Gong, Yihong ;

Zhou, Sanping ;

Wang, Jinjun ;

Zheng, Nanning .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1335-1344

[6]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[7] The PASCAL Visual Object Classes Challenge: A Retrospective [J].

Everingham, Mark ;

Eslami, S. M. Ali ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136

[8]

Furlanello T, 2018, PR MACH LEARN RES, V80

[9] Compact Bilinear Pooling [J].

Gao, Yang ;

Beijbom, Oscar ;

Zhang, Ning ;

Darrell, Trevor .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :317-326

[10] Cross Modal Distillation for Supervision Transfer [J].

Gupta, Saurabh ;

Hoffman, Judy ;

Malik, Jitendra .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2827-2836

← 1 2 3 4 5 6 →