Towards Automatic Wild Animal Detection in Low Quality Camera-trap Images Using Two-channeled Perceiving Residual Pyramid Networks

被引:20
作者
Zhu, Chunbiao [1 ]
Li, Thomas H. [2 ]
Li, Ge [1 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, SECE, Shenzhen, Peoples R China
[2] Gpower Semicond Inc, Suzhou, Peoples R China
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017) | 2017年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCVW.2017.337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monitoring animals in the wild without disturbing them is possible using camera trapping framework, which is a technique to study wildlife using automatically triggered cameras and produces great volumes of data. However, camera trapping collects images often result in low image quality and includes a lot of false positives (images without animals), which must be detection before the post-processing step. This paper presents a two-channeled perceiving residual pyramid networks (TPRPN) for cameratrap images objection. Our TPRPN model attends to generating high-resolution and high-quality results. In order to provide enough local information, we extract depth cue from the original images and use two-channeled perceiving model as input to training our networks. Finally, the proposed three-layer residual blocks learn to merge all the information and generate full size detection results. Besides, we construct a new high-quality dataset with the help of Wildlife Thailand's Community and eMammal Organization. Experimental results on our dataset demonstrate that our method is superior to the existing object detection methods.
引用
收藏
页码:2860 / 2864
页数:5
相关论文
共 18 条
[11]  
Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[12]   Secrets of Optical Flow Estimation and Their Principles [J].
Sun, Deqing ;
Roth, Stefan ;
Black, Michael J. .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :2432-2439
[13]   Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna [J].
Swanson, Alexandra ;
Kosmala, Margaret ;
Lintott, Chris ;
Simpson, Robert ;
Smith, Arfon ;
Packer, Craig .
SCIENTIFIC DATA, 2015, 2
[14]   Automated identification of animal species in camera trap images [J].
Yu, Xiaoyuan ;
Wang, Jiangping ;
Kays, Roland ;
Jansen, Patrick A. ;
Wang, Tianjiang ;
Huang, Thomas .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2013,
[15]  
Zhu C., 2017, 2017 IEEE INT C COMP
[16]  
Zhu C., 2017, ACM MULTIMEDIA WORKS
[17]  
Zhu C., 2017, MULTILAYER BACKPROPA, P14
[18]   Salient Object Detection with Complex Scene based on Cognitive Neuroscience [J].
Zhu, Chunbiao ;
Li, Ge ;
Wang, Wenmin ;
Wang, Ronggang .
2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, :33-37