Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning

被引:37
|
作者
Yuan, Xiang [1 ]
Cheng, Gong [1 ]
Yan, Kebing [1 ]
Zeng, Qinghua [1 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian, Peoples R China
关键词
D O I
10.1109/ICCV51070.2023.00581
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The past few years have witnessed the immense success of object detection, while current excellent detectors struggle on tackling size-limited instances. Concretely, the well-known challenge of low overlaps between the priors and object regions leads to a constrained sample pool for optimization, and the paucity of discriminative information further aggravates the recognition. To alleviate the aforementioned issues, we propose CFINet, a two-stage framework tailored for small object detection based on the Coarseto-fine pipeline and Feature Imitation learning. Firstly, we introduce Coarse-to-fine RPN (CRPN) to ensure sufficient and high-quality proposals for small objects through the dynamic anchor selection strategy and cascade regression. Then, we equip the conventional detection head with a Feature Imitation (FI) branch to facilitate the region representations of size-limited instances that perplex the model in an imitation manner. Moreover, an auxiliary imitation loss following supervised contrastive learning paradigm is devised to optimize this branch. When integrated with Faster RCNN, CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A, underscoring its superiority over baseline detector and other mainstream detection approaches. Code is available at https://github.com/shaunyuan22/CFINet.
引用
收藏
页码:6294 / 6304
页数:11
相关论文
共 50 条
  • [1] Dynamic Coarse-to-Fine Learning for Oriented Tiny Object Detection
    Xu, Chang
    Ding, Jian
    Wang, Jinwang
    Yang, Wen
    Yu, Huai
    Yu, Lei
    Xia, Gui-Song
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7318 - 7328
  • [2] Learning Coarse-to-Fine Sparselets for Efficient Object Detection and Scene Classification
    Cheng, Gong
    Han, Junwei
    Guo, Lei
    Liu, Tianming
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1173 - 1181
  • [3] Recursive coarse-to-fine localization for fast object detection
    Na, I.S. (ypencil@hanmail.net), 1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (07):
  • [4] A Coarse-to-fine approach for fast deformable object detection
    Pedersoli, Marco
    Vedaldi, Andrea
    Gonzalez, Jordi
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1353 - 1360
  • [5] A coarse-to-fine approach for fast deformable object detection
    Pedersoli, Marco
    Vedaldi, Andrea
    Gonzalez, Jordi
    Roca, Xavier
    PATTERN RECOGNITION, 2015, 48 (05) : 1844 - 1853
  • [6] Recursive Coarse-to-Fine Localization for Fast Object Detection
    Pedersoli, Marco
    Gonzalez, Jordi
    Bagdanov, Andrew D.
    Villanueva, Juan J.
    COMPUTER VISION - ECCV 2010, PT VI, 2010, 6316 : 280 - +
  • [7] Salient object detection using coarse-to-fine processing
    Zhou, Qiangqiang
    Zhang, Lin
    Zhao, Weidong
    Liu, Xianhui
    Chen, Yufei
    Wang, Zhicheng
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2017, 34 (03) : 370 - 383
  • [8] Coarse-to-Fine Imitation Learning: Robot Manipulation from a Single Demonstration
    Johns, Edward
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4613 - 4619
  • [9] A coarse-to-fine small object detection framework based on a background complexity classification strategy
    Wang R.
    Yang J.
    Xu Y.
    Li H.
    Neural Computing and Applications, 2024, 36 (19) : 11241 - 11255
  • [10] Coarse-to-fine domain adaptation object detection with feature disentanglement
    Li, Jiafeng
    Zhi, Mengxun
    Zheng, Yongyu
    Zhuo, Li
    Zhang, Jing
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,