Scale-Balanced Real-Time Object Detection With Varying Input-Image Resolution

被引:9
|
作者
Yan, Longbin [1 ]
Qin, Yunxiao [2 ]
Chen, Jie [1 ]
机构
[1] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China
[2] Commun Univ China, Neurosci & Intelligent Media Inst, Beijing 100024, Peoples R China
关键词
Feature extraction; Detectors; Head; Image resolution; Task analysis; Semantics; Object detection; Deep convolution neural network (CNN); object detection; multi-scale features fusion;
D O I
10.1109/TCSVT.2022.3198329
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Current object-detection methods for small-scale objects are often marred by poor performance. Using relatively high-resolution input images can be considered a remedy for this issue, but it usually leads to performance degeneration for large-scale objects. We define this problem as the imbalance of detection performance for multi-scale objects when the resolution of input images varies. In addition, the use of high-resolution images results in significant computational resource consumption and inference-speed impairment. In this paper, we propose a friendly varying-resolution object-detection method for multi-scale objects. We analyze in detail the reasons leading to the performance degradation in the detection of large-scale objects with increasing input-image resolution, and propose a novel lightweight bidirectional feature-flow module to enhance the performance of multi-scale object detection in high-resolution images, especially for large-scale objects. The proposed approach can also ease the problems of computational resource consumption and inference-speed impairment caused by high-resolution images. Additionally, a decoupled detection head is designed to further improve performance by separating classification and regression sub-tasks, and an adaptive feature-fusion module is designed to better fuse different feature levels. The proposed scheme alleviates the negative effects of using high-resolution input images and achieves an excellent balance between inference speed and precision. Experiments on the MS COCO dataset show that the scheme achieves 44.6 AP at 42.6 FPS and 47 AP at 26.7 FPS, showing significant advantages over the methods to which it is compared.
引用
收藏
页码:242 / 256
页数:15
相关论文
共 50 条
  • [1] Scale-balanced loss for object detection
    Shuang, Kai
    Lyu, Zhiheng
    Loo, Jonathan
    Zhang, Wentao
    PATTERN RECOGNITION, 2021, 117
  • [2] Split-and-Shuffle Detector for Real-Time Traffic Object Detection in Aerial Image
    Mao, Guotao
    Liang, Hongbin
    Yao, Yiting
    Wang, Lei
    Zhang, Han
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (08): : 13312 - 13326
  • [3] RT-Deblur: real-time image deblurring for object detection
    Hanzhao Wang
    Chunhua Hu
    Weijie Qian
    Qian Wang
    The Visual Computer, 2024, 40 : 2873 - 2887
  • [4] RT-Deblur: real-time image deblurring for object detection
    Wang, Hanzhao
    Hu, Chunhua
    Qian, Weijie
    Wang, Qian
    VISUAL COMPUTER, 2024, 40 (04) : 2873 - 2887
  • [5] Joint Anchor-Feature Refinement for Real-Time Accurate Object Detection in Images and Videos
    Chen, Xingyu
    Yu, Junzhi
    Kong, Shihan
    Wu, Zhengxing
    Wen, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 594 - 607
  • [6] Frequency Spectrum Features Modeling for Real-Time Tiny Object Detection in Remote Sensing Image
    Luo, Zhaoyi
    Wang, Yupei
    Chen, Liang
    Yang, Wenying
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [7] Energy-Efficient Real-Time UAV Object Detection on Embedded Platforms
    Deng, Jianing
    Shi, Zhiguo
    Zhuo, Cheng
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (10) : 3123 - 3127
  • [8] Real-time small scale object detection algorithm in real sea area
    Feng H.
    Jiang C.
    Ding Y.
    Xu H.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (04): : 24 - 29
  • [9] Real-time object detection on CUDA
    Herout, Adam
    Josth, Radovan
    Juranek, Roman
    Havel, Jiri
    Hradis, Michal
    Zemcik, Pavel
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2011, 6 (03) : 159 - 170
  • [10] Real-Time Object Detection and Tracking
    Naeem, Hammad
    Ahmad, Jawad
    Tayyab, Muhammad
    2013 16TH INTERNATIONAL MULTI TOPIC CONFERENCE (INMIC), 2013, : 148 - 153