Count-ception: Counting by Fully Convolutional Redundant Counting

被引:85
作者
Cohen, Joseph Paul [1 ,2 ]
Boucher, Genevieve [3 ]
Glastonbury, Craig A. [4 ]
Lo, Henry Z. [5 ]
Bengio, Yoshua [6 ]
机构
[1] Univ Montreal, Montreal Inst Learning Algorithms, Montreal, PQ, Canada
[2] Harvard Univ Herbaria, Cambridge, MA 02138 USA
[3] Univ Montreal, Inst Res Immunol & Canc, Montreal, PQ, Canada
[4] Univ Oxford, Big Data Inst, Oxford, England
[5] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
[6] Univ Montreal, Montreal Inst Learning Algorithms, CIFAR, Montreal, PQ, Canada
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017) | 2017年
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICCVW.2017.9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Counting objects in digital images is a process that should be replaced by machines. This tedious task is time consuming and prone to errors due to fatigue of human annotators. The goal is to have a system that takes as input an image and returns a count of the objects inside and justification for the prediction in the form of object localization. We repose a problem, originally posed by Lempitsky and, to instead predict a count map which contains redundant counts based on the receptive field of a smaller regression network. The regression network predicts a count of the objects that exist inside this frame. By processing the image in a fully convolutional way each pixel is going to be accounted for some number of times, the number of windows which include it, which is the size of each window, (i.e., 32x32 = 1024). To recover the true count we take the average over the redundant predictions. Our contribution is redundant counting instead of predicting a density map in order to average over errors. We also propose a novel deep neural network architecture adapted from the Inception family of networks called the Count-ception network. Together our approach results in a 20% relative improvement (2.9 to 2.3 MAE) over the state of the art method by Xie, Noble, and Zisserman in 2016.
引用
收藏
页码:18 / 26
页数:9
相关论文
共 26 条
[1]  
[Anonymous], 2016, ARXIV
[2]  
[Anonymous], 2013, P 30 INT C MACH LEAR
[3]  
[Anonymous], C COMP VIS PATT REC
[4]  
[Anonymous], 2015, NATURE
[5]  
[Anonymous], 2016, C COMP VIS PATT REC
[6]  
[Anonymous], J ELECT IMAGING
[7]  
[Anonymous], NATURE GENETICS
[8]  
[Anonymous], 2009, INT C MACH LEARN
[9]  
[Anonymous], 2016, EUR C COMP VIS
[10]  
[Anonymous], 2014, EUR C COMP VIS