Giant Panda Identification

被引:22
作者
Wang, Le [1 ]
Ding, Rizhi [1 ]
Zhai, Yuanhao [1 ]
Zhang, Qilin [2 ]
Tang, Wei [3 ]
Zheng, Nanning [1 ]
Hua, Gang [4 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
[2] ABB Corp Res Ctr, Raleigh, NC 27606 USA
[3] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
[4] Wormpex AI Res, Bellevue, WA 98004 USA
基金
国家重点研发计划;
关键词
Task analysis; Detectors; Feature extraction; Annotations; Visualization; Face recognition; Convolution; Giant panda identification; feature fusion; patch detector; fine-grained recognition;
D O I
10.1109/TIP.2021.3055627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The lack of automatic tools to identify giant panda makes it hard to keep track of and manage giant pandas in wildlife conservation missions. In this paper, we introduce a new Giant Panda Identification (GPID) task, which aims to identify each individual panda based on an image. Though related to the human re-identification and animal classification problem, GPID is extraordinarily challenging due to subtle visual differences between pandas and cluttered global information. In this paper, we propose a new benchmark dataset iPanda-50 for GPID. The iPanda-50 consists of 6, 874 images from 50 giant panda individuals, and is collected from panda streaming videos. We also introduce a new Feature-Fusion Network with Patch Detector (FFN-PD) for GPID. The proposed FFN-PD exploits the patch detector to detect discriminative local patches without using any part annotations or extra location sub-networks, and builds a hierarchical representation by fusing both global and local features to enhance the inter-layer patch feature interactions. Specifically, an attentional cross-channel pooling is embedded in the proposed FFN-PD to improve the identify-specific patch detectors. Experiments performed on the iPanda-50 datasets demonstrate the proposed FFN-PD significantly outperforms competing methods. Besides, experiments on other fine-grained recognition datasets (i.e., CUB-200-2011, Stanford Cars, and FGVC-Aircraft) demonstrate that the proposed FFN-PD outperforms existing state-of-the-art methods.
引用
收藏
页码:2837 / 2849
页数:13
相关论文
共 58 条
[1]  
Branson S., 2014, BMVC, P7
[2]   Higher-order Integration of Hierarchical Convolutional Activations for Fine-grained Visual Categorization [J].
Cai, Sijia ;
Zuo, Wangmeng ;
Zhang, Lei .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :511-520
[3]   Destruction and Construction Learning for Fine-grained Image Recognition [J].
Chen, Yue ;
Bai, Yalong ;
Zhang, Wei ;
Mei, Tao .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5152-5161
[4]   PaMM: Pose-Aware Multi-Shot Matching for Improving Person Re-Identification [J].
Cho, Yeong-Jun ;
Yoon, Kuk-Jin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) :3739-3752
[5]   Kernel Pooling for Convolutional Neural Networks [J].
Cui, Yin ;
Zhou, Feng ;
Wang, Jiang ;
Liu, Xiao ;
Lin, Yuanqing ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3049-3058
[6]  
Ding RZ, 2020, INT CONF ACOUST SPEE, P2108, DOI [10.1109/icassp40776.2020.9052960, 10.1109/ICASSP40776.2020.9052960]
[7]   Selective Sparse Sampling for Fine-grained Image Recognition [J].
Ding, Yao ;
Zhou, Yanzhao ;
Zhu, Yi ;
Ye, Qixiang ;
Jiao, Jianbin .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6598-6607
[8]  
Farrell R, 2011, IEEE I CONF COMP VIS, P161, DOI 10.1109/ICCV.2011.6126238
[9]   Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition [J].
Fu, Jianlong ;
Zheng, Heliang ;
Mei, Tao .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4476-4484
[10]   Compact Bilinear Pooling [J].
Gao, Yang ;
Beijbom, Oscar ;
Zhang, Ning ;
Darrell, Trevor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :317-326