Aggregating diverse deep attention networks for large-scale plant species identification

被引：11

作者：

Zhang, Haixi ^{[1
]}

Kuang, Zhenzhong ^{[2
]}

Peng, Xianlin ^{[1
]}

He, Guiqing ^{[1
]}

Peng, Jinye ^{[1
]}

Fan, Jianping ^{[3
]}

机构：

[1] Northwestern Polytech Univ, Xian, Shaanxi, Peoples R China

[2] Hangzhou Dianzi Univ, Hangzhou, Peoples R China

[3] Northwest Univ, Xian, Shaanxi, Peoples R China

来源：

NEUROCOMPUTING | 2020年 / 378卷

基金：

中国国家自然科学基金;

关键词：

Large-scale plant species identification; Plant taxonomy; Attention-based hierarchical multi-task learning; Fusion; CLASSIFICATION;

D O I：

10.1016/j.neucom.2019.10.077

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a novel fusion method is proposed to deal with large-scale plant species identification by aggregating diverse outputs from multiple deep networks, where each deep network focus on one subset of the whole plant species. Firstly, a fixed plant taxonomy is constructed for organizing large number of fine-grained plant species hierarchically and it is further used as a guideline to help generating diverse but overlapped task groups. Secondly, an attention-based deep hierarchical multi-task learning (AHMTL) algorithm is proposed to recognize fine-grained plant species belonging to the same task group effectively by learning more discriminative deep features and classifiers jointly. Finally, we fuse all outputs from multiple deep networks to obtain the final high-level feature representation and give the prediction probability for each plant species. The experimental results have proved the effectiveness of our proposed method on large-scale plant species identification. (C) 2019 Elsevier B.V. All rights reserved.

引用

页码：283 / 294

页数：12

共 60 条

[31] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[32] Integrating multi-level deep learning and concept ontology for large-scale visual recognition [J].

Kuang, Zhenzhong ;

Yu, Jun ;

Li, Zongmin ;

Zhang, Baopeng ;

Fan, Jianping .

PATTERN RECOGNITION, 2018, 78 :198-214

[33]

Kumar N, 2012, LECT NOTES COMPUT SC, V7573, P502, DOI 10.1007/978-3-642-33709-3_36

[34] Gradient-based learning applied to document recognition [J].

Lecun, Y ;

Bottou, L ;

Bengio, Y ;

Haffner, P .

PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324

[35]

Lee CY, 2016, JMLR WORKSH CONF PRO, V51, P464

[36] Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network [J].

Li, Sijin ;

Liu, Zhi-Qiang ;

Chan, Antoni B. .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, :488-+

[37]

Lin Min, 2013, 13124400 ARXIV

[38] A survey of deep neural network architectures and their applications [J].

Liu, Weibo ;

Wang, Zidong ;

Liu, Xiaohui ;

Zeng, Nianyin ;

Liu, Yurong ;

Alsaadi, Fuad E. .

NEUROCOMPUTING, 2017, 234 :11-26

[39]

Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965

[40]

Martínez-Muñoz G, 2009, PROC CVPR IEEE, P549, DOI 10.1109/CVPRW.2009.5206574

← 1 2 3 4 5 6 →