Efficient Point Process Inference for Large-scale Object Detection

被引:12
作者
Pham, Trung T. [1 ]
Rezatofighi, Seyed Hamid [1 ]
Reid, Ian [1 ]
Chin, Tat-Jun [1 ]
机构
[1] Univ Adelaide, Sch Comp Sci, Adelaide, SA, Australia
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
基金
澳大利亚研究理事会;
关键词
EXTRACTION; MODELS;
D O I
10.1109/CVPR.2016.310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle the problem of large-scale object detection in images, where the number of objects can be arbitrarily large, and can exhibit significant overlap/occlusion. A successful approach to modelling the large-scale nature of this problem has been via point process density functions which jointly encode object qualities and spatial interactions. But the corresponding optimisation problem is typically difficult or intractable, and many of the best current methods rely on Monte Carlo Markov Chain (MCMC) simulation, which converges slowly in a large solution space. We propose an efficient point process inference for large-scale object detection using discrete energy minimization. In particular, we approximate the solution space by a finite set of object proposals and cast the point process density function to a corresponding energy function of binary variables whose values indicate which object proposals are accepted. We resort to the local submodular approximation (LSA) based trust-region optimisation to find the optimal solution. Furthermore we analyse the error of LSA approximation, and show how to adjust the point process energy to dramatically speed up the convergence without harming the optimality. We demonstrate the superior efficiency and accuracy of our method using a variety of large-scale object detection applications such as crowd human detection, birds, cells counting/localization.
引用
收藏
页码:2837 / 2845
页数:9
相关论文
共 29 条
  • [1] [Anonymous], 2000, ICIAM
  • [2] Arteta C, 2014, LECT NOTES COMPUT SC, V8691, P504, DOI 10.1007/978-3-319-10578-9_33
  • [3] Baddeley A., 1993, STAT IMAGES, V1, P231, DOI DOI 10.1080/02664769300000065
  • [4] Barinova Olga., 2010, CVPR
  • [5] Privacy preserving crowd monitoring: Counting people without people models or tracking
    Chan, Antoni B.
    Liang, Zhang-Sheng John
    Vasconcelos, Nuno
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 1766 - 1772
  • [6] Discriminative Models for Multi-Class Object Layout
    Desai, Chaitanya
    Ramanan, Deva
    Fowlkes, Charless C.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 95 (01) : 1 - 12
  • [7] Descamps S., 2008, ICASSP
  • [8] Marked point process in image analysis
    Descombes, X
    Zerubia, J
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2002, 19 (05) : 77 - 84
  • [9] Object Extraction Using a Stochastic Birth-and-Death Dynamics in Continuum
    Descombes, Xavier
    Minlos, Robert
    Zhizhina, Elena
    [J]. JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2009, 33 (03) : 347 - 359
  • [10] Pedestrian Detection: An Evaluation of the State of the Art
    Dollar, Piotr
    Wojek, Christian
    Schiele, Bernt
    Perona, Pietro
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) : 743 - 761