Giant Panda Identification

被引：22

作者：

Wang, Le ^{[1
]}

Ding, Rizhi ^{[1
]}

Zhai, Yuanhao ^{[1
]}

Zhang, Qilin ^{[2
]}

Tang, Wei ^{[3
]}

Zheng, Nanning ^{[1
]}

Hua, Gang ^{[4
]}

机构：

[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China

[2] ABB Corp Res Ctr, Raleigh, NC 27606 USA

[3] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA

[4] Wormpex AI Res, Bellevue, WA 98004 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2021年 / 30卷

基金：

国家重点研发计划;

关键词：

Task analysis; Detectors; Feature extraction; Annotations; Visualization; Face recognition; Convolution; Giant panda identification; feature fusion; patch detector; fine-grained recognition;

D O I：

10.1109/TIP.2021.3055627

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The lack of automatic tools to identify giant panda makes it hard to keep track of and manage giant pandas in wildlife conservation missions. In this paper, we introduce a new Giant Panda Identification (GPID) task, which aims to identify each individual panda based on an image. Though related to the human re-identification and animal classification problem, GPID is extraordinarily challenging due to subtle visual differences between pandas and cluttered global information. In this paper, we propose a new benchmark dataset iPanda-50 for GPID. The iPanda-50 consists of 6, 874 images from 50 giant panda individuals, and is collected from panda streaming videos. We also introduce a new Feature-Fusion Network with Patch Detector (FFN-PD) for GPID. The proposed FFN-PD exploits the patch detector to detect discriminative local patches without using any part annotations or extra location sub-networks, and builds a hierarchical representation by fusing both global and local features to enhance the inter-layer patch feature interactions. Specifically, an attentional cross-channel pooling is embedded in the proposed FFN-PD to improve the identify-specific patch detectors. Experiments performed on the iPanda-50 datasets demonstrate the proposed FFN-PD significantly outperforms competing methods. Besides, experiments on other fine-grained recognition datasets (i.e., CUB-200-2011, Stanford Cars, and FGVC-Aircraft) demonstrate that the proposed FFN-PD outperforms existing state-of-the-art methods.

引用

页码：2837 / 2849

页数：13

共 58 条

[1]

Branson S., 2014, BMVC, P7

[2] Higher-order Integration of Hierarchical Convolutional Activations for Fine-grained Visual Categorization [J].

Cai, Sijia ;

Zuo, Wangmeng ;

Zhang, Lei .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :511-520

[3] Destruction and Construction Learning for Fine-grained Image Recognition [J].

Chen, Yue ;

Bai, Yalong ;

Zhang, Wei ;

Mei, Tao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5152-5161

[4] PaMM: Pose-Aware Multi-Shot Matching for Improving Person Re-Identification [J].

Cho, Yeong-Jun ;

Yoon, Kuk-Jin .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) :3739-3752

[5] Kernel Pooling for Convolutional Neural Networks [J].

Cui, Yin ;

Zhou, Feng ;

Wang, Jiang ;

Liu, Xiao ;

Lin, Yuanqing ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3049-3058

[6]

Ding RZ, 2020, INT CONF ACOUST SPEE, P2108, DOI [10.1109/icassp40776.2020.9052960, 10.1109/ICASSP40776.2020.9052960]

[7] Selective Sparse Sampling for Fine-grained Image Recognition [J].

Ding, Yao ;

Zhou, Yanzhao ;

Zhu, Yi ;

Ye, Qixiang ;

Jiao, Jianbin .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6598-6607

[8]

Farrell R, 2011, IEEE I CONF COMP VIS, P161, DOI 10.1109/ICCV.2011.6126238

[9] Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition [J].

Fu, Jianlong ;

Zheng, Heliang ;

Mei, Tao .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4476-4484

[10] Compact Bilinear Pooling [J].

Gao, Yang ;

Beijbom, Oscar ;

Zhang, Ning ;

Darrell, Trevor .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :317-326

← 1 2 3 4 5 6 →