Kernel pooling feature representation of pre-trained convolutional neural networks for leaf recognition

被引：3

作者：

Feng, Shu ^{[1
]}

机构：

[1] Shanxi Agr Univ, Dept Fdn, Taigu 030801, Shanxi, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2022年 / 81卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Leaf recognition; Feature representation; Convolutional neural networks; Kernel pooling; Second order information; NONRIGID SHAPES; PLANT; CLASSIFICATION; DESCRIPTOR; PROJECTION; RETRIEVAL; ROTATION; DISTANCE; IMAGE;

D O I：

10.1007/s11042-021-11769-0

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Due to the presence of various types of factors, such as illumination, viewpoint, intra-class complexity, and inter-class similarity, which make plant leaf recognition still a challenging research problem. In this paper, we present a very simple, yet effective feature representation method for plant leaf recognition. Concretely, it comprises four stages. Firstly, each leaf image is fed into an imagenet pre-trained CNN model to extract activated feature maps in a specified layer. Secondly, inspired by 1 x1 convolution, we exploit principle component analysis to learn the 1 x1 convolution filters. As a result, it not only eliminates the redundant information, reduces the feature dimension adaptively that is beneficial to the subsequent high order pooling, but also increases classification accuracy. Thirdly, kernel pooling is employed to capture second order statistics between each pair of features with the purpose of learning more discriminative information. Finally, matrix sqrt and upper triangle are performed to obtain the final leaf representation, which is utilized for classification and retrieval by the euclidean distance based nearest neighbor classifier. Extensive experiments are conducted on four representative plant leaf datasets, Flavia, Swedish, MEW2012, ICL, to validate the effectiveness of our method. For classification task, our method achieves outstanding and better average classification accuracies than the comparative state-of-theart baselines. For retrieval task, our method gets significant higher or competitive MAP scores. Our implementation code will be available at https://github.com/fengshu666666/leafrecognition.

引用

页码：4255 / 4282

页数：28

共 72 条

[1] A multiscale representation method for nonrigid shapes with a single closed contour [J].

Adamek, T ;

O'Connor, NE .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (05) :742-753

[2] Shape retrieval using triangle-area representation and dynamic space warping [J].

Alajlan, Naif ;

El Rube, Ibrahim ;

Kamel, Mohamed S. ;

Freeman, George .

PATTERN RECOGNITION, 2007, 40 (07) :1911-1920

[3]

[Anonymous], 2015, ICLR

[4]

Araujo VM, 2020, 2 VIEW FINE GRAINED

[5] Co-Transduction for Shape Retrieval [J].

Bai, Xiang ;

Wang, Bo ;

Yao, Cong ;

Liu, Wenyu ;

Tu, Zhuowen .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (05) :2747-2757

[6] Shape matching and object recognition using shape contexts [J].

Belongie, S ;

Malik, J ;

Puzicha, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522

[7] Free-Form Region Description with Second-Order Pooling [J].

Carreira, Joao ;

Caseiro, Rui ;

Batista, Jorge ;

Sminchisescu, Cristian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (06) :1177-1189

[8] The devil is in the details: an evaluation of recent feature encoding methods [J].

Chatfield, Ken ;

Lempitsky, Victor ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,

[9] Invariant leaf image recognition with histogram of Gaussian convolution vectors [J].

Chen, Xin ;

Wang, Bin .

COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 178

[10] Plant species classification using deep convolutional neural network [J].

Dyrmann, Mads ;

Karstoft, Henrik ;

Midtiby, Henrik Skov .

BIOSYSTEMS ENGINEERING, 2016, 151 :72-80

← 1 2 3 4 5 6 7 8 →