Center and Scale Prediction: Anchor-free Approach for Pedestrian and Face Detection

被引:24
|
作者
Liu, Wei [1 ]
Hasan, Irtiza [2 ]
Liao, Shengcai [2 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
[2] Incept Inst Artificial Intelligence IIAI, Abu Dhabi, U Arab Emirates
关键词
Object Detection; Convolutional Neural Networks; Feature Detection; anchor-free; FEATURES;
D O I
10.1016/j.patcog.2022.109071
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection traditionally requires sliding-window classifier in modern deep learning based ap-proaches. However, both of these approaches requires tedious configurations in bounding boxes. Gen-erally speaking, single-class object detection is to tell where the object is, and how big it is. Traditional methods combine the "where" and "how" subproblems into a single one through the overall judgement of various scales of bounding boxes. In view of this, we are interesting in whether the "where" and "how" subproblems can be separated into two independent subtasks to ease the problem definition and the dif-ficulty of training. Accordingly, we provide a new perspective where detecting objects is approached as a high-level semantic feature detection task. Like edges, corners, blobs and other feature detectors, the pro-posed detector scans for feature points all over the image, for which the convolution is naturally suited. However, unlike these traditional low-level features, the proposed detector goes for a higher-level ab-straction, that is, we are looking for central points where there are objects, and modern deep models are already capable of such a high-level semantic abstraction. Like blob detection, we also predict the scales of the central points, which is also a straightforward convolution. Therefore, in this paper, pedestrian and face detection is simplified as a straightforward center and scale prediction task through convolu-tions. This way, the proposed method enjoys an anchor-free setting, considerably reducing the difficulty in training configuration and hyper-parameter optimization. Though structurally simple, it presents com-petitive accuracy on several challenging benchmarks, including pedestrian detection and face detection. Furthermore, a cross-dataset evaluation is performed, demonstrating a superior generalization ability of the proposed method.(c) 2022 Published by Elsevier Ltd.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Orientation-Aware Vehicle Detection in Aerial Images via an Anchor-Free Object Detection Approach
    Shi, Furong
    Zhang, Tong
    Zhang, Tao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 5221 - 5233
  • [22] Coordinate-based anchor-free module for object detection
    Zhiyong Tang
    Jianbing Yang
    Zhongcai Pei
    Xiao Song
    Applied Intelligence, 2021, 51 : 9066 - 9080
  • [23] An improved anchor-free method for traffic scene object detection
    Ding, Tonghe
    Feng, Kaili
    Yan, Yejin
    Wei, Yanjun
    Li, Tianping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34703 - 34724
  • [24] An improved anchor-free method for traffic scene object detection
    Tonghe Ding
    Kaili Feng
    Yejin Yan
    Yanjun Wei
    Tianping Li
    Multimedia Tools and Applications, 2023, 82 : 34703 - 34724
  • [25] Coordinate-based anchor-free module for object detection
    Tang, Zhiyong
    Yang, Jianbing
    Pei, Zhongcai
    Song, Xiao
    APPLIED INTELLIGENCE, 2021, 51 (12) : 9066 - 9080
  • [26] ObjectBox: From Centers to Boxes for Anchor-Free Object Detection
    Zand, Mohsen
    Etemad, Ali
    Greenspan, Michael
    COMPUTER VISION, ECCV 2022, PT X, 2022, 13670 : 390 - 406
  • [27] Attention-based digital filter with anchor-free feature pyramid learning model for pedestrian detection
    Shrivastava A.
    Poonkuntran S.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04) : 10287 - 10303
  • [28] LGADet: Light-weight Anchor-free Multispectral Pedestrian Detection with Mixed Local and Global Attention
    Zuo, Xin
    Wang, Zhi
    Liu, Yue
    Shen, Jifeng
    Wang, Haoran
    NEURAL PROCESSING LETTERS, 2023, 55 (03) : 2935 - 2952
  • [29] AF-EMS Detector: Improve the Multi-Scale Detection Performance of the Anchor-Free Detector
    Yan, Jiangqiao
    Zhao, Liangjin
    Diao, Wenhui
    Wang, Hongqi
    Sun, Xian
    REMOTE SENSING, 2021, 13 (02) : 1 - 18
  • [30] LGADet: Light-weight Anchor-free Multispectral Pedestrian Detection with Mixed Local and Global Attention
    Xin Zuo
    Zhi Wang
    Yue Liu
    Jifeng Shen
    Haoran Wang
    Neural Processing Letters, 2023, 55 : 2935 - 2952