Scale-Balanced Real-Time Object Detection With Varying Input-Image Resolution

被引:9
|
作者
Yan, Longbin [1 ]
Qin, Yunxiao [2 ]
Chen, Jie [1 ]
机构
[1] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China
[2] Commun Univ China, Neurosci & Intelligent Media Inst, Beijing 100024, Peoples R China
关键词
Feature extraction; Detectors; Head; Image resolution; Task analysis; Semantics; Object detection; Deep convolution neural network (CNN); object detection; multi-scale features fusion;
D O I
10.1109/TCSVT.2022.3198329
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Current object-detection methods for small-scale objects are often marred by poor performance. Using relatively high-resolution input images can be considered a remedy for this issue, but it usually leads to performance degeneration for large-scale objects. We define this problem as the imbalance of detection performance for multi-scale objects when the resolution of input images varies. In addition, the use of high-resolution images results in significant computational resource consumption and inference-speed impairment. In this paper, we propose a friendly varying-resolution object-detection method for multi-scale objects. We analyze in detail the reasons leading to the performance degradation in the detection of large-scale objects with increasing input-image resolution, and propose a novel lightweight bidirectional feature-flow module to enhance the performance of multi-scale object detection in high-resolution images, especially for large-scale objects. The proposed approach can also ease the problems of computational resource consumption and inference-speed impairment caused by high-resolution images. Additionally, a decoupled detection head is designed to further improve performance by separating classification and regression sub-tasks, and an adaptive feature-fusion module is designed to better fuse different feature levels. The proposed scheme alleviates the negative effects of using high-resolution input images and achieves an excellent balance between inference speed and precision. Experiments on the MS COCO dataset show that the scheme achieves 44.6 AP at 42.6 FPS and 47 AP at 26.7 FPS, showing significant advantages over the methods to which it is compared.
引用
收藏
页码:242 / 256
页数:15
相关论文
共 50 条
  • [21] Real-time object detection applied on drones
    Wei, Jingjing
    Zhao, Yiding
    International Agricultural Engineering Journal, 2019, 28 (04): : 450 - 459
  • [22] MonoAMNet: Three-Stage Real-Time Monocular 3D Object Detection With Adaptive Methods
    Pan, Huihui
    Jia, Yisong
    Wang, Jue
    Sun, Weichao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3574 - 3587
  • [23] DPNet: Dual-Path Network for Real-Time Object Detection With Lightweight Attention
    Zhou, Quan
    Shi, Huimin
    Xiang, Weikang
    Kang, Bin
    Latecki, Longin Jan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [24] Salient Object Detection by Spatiotemporal and Semantic Features in Real-Time Video Processing Systems
    Fang, Yuming
    Ding, Guanqun
    Wen, Wenying
    Yuan, Feiniu
    Yang, Yong
    Fang, Zhijun
    Lin, Weisi
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (11) : 9893 - 9903
  • [25] Region Boosting for Real-Time Object Detection Using Multi-Dimensional Attention
    Chen, Jinlong
    Xu, Kejian
    Ning, Yi
    Xu, Zhi
    IEEE ACCESS, 2024, 12 : 171634 - 171643
  • [26] Spatial Attention Based Real-Time Object Detection Network for Internet of Things Devices
    Zhang, Yongxin
    Zhao, Peng
    Li, Deguang
    Konstantin, Kostromitin
    IEEE ACCESS, 2020, 8 : 165863 - 165871
  • [27] A Multi-target Edge Service Approach to Real-time Image Object Detection
    Xin, Tinglin
    Li, Shuo
    Zhao, Ting
    Xia, Weishang
    Zhao, Lijiao
    2020 13TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2020), 2020, : 426 - 431
  • [28] Real-time Object Detection for 360-degree Panoramic Image using CNN
    Zhang, Yiming
    Xiao, Xiangyun
    Yang, Xubo
    2017 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV 2017), 2017, : 18 - 23
  • [29] LARNet: Towards Lightweight, Accurate and Real-Time Salient Object Detection
    Wang, Zhenyu
    Zhang, Yunzhou
    Liu, Yan
    Qin, Cao
    Coleman, Sonya A.
    Kerr, Dermot
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5207 - 5222
  • [30] Real-time moving object detection algorithm on high-resolution videos using GPUs
    Praveen Kumar
    Ayush Singhal
    Sanyam Mehta
    Ankush Mittal
    Journal of Real-Time Image Processing, 2016, 11 : 93 - 109