A New Approach for Super Resolution Object Detection Using an Image Slicing Algorithm and the Segment Anything Model

被引:1
作者
Telceken, Muhammed [1 ,2 ]
Akgun, Devrim [3 ]
Kacar, Sezgin [4 ]
Bingol, Bunyamin [4 ]
机构
[1] Sakarya Univ, Inst Nat Sci, Comp Engn, TR-54050 Sakarya, Turkiye
[2] Sakarya Univ Appl Sci, Comp Engn Dept, TR-54050 Sakarya, Turkiye
[3] Sakarya Univ, Software Engn Dept, TR-54050 Sakarya, Turkiye
[4] Sakarya Univ Appl Sci, Elect & Elect Engn Dept, TR-54050 Sakarya, Turkiye
关键词
object detection; super resolution; YOLO; SAM; SRGAN; xView; VisDrone;
D O I
10.3390/s24144526
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Object detection in high resolution enables the identification and localization of objects for monitoring critical areas with precision. Although there have been improvements in object detection at high resolution, the variety of object scales, as well as the diversity of backgrounds and textures in high-resolution images, make it challenging for detectors to generalize successfully. This study introduces a new method for object detection in high-resolution images. The pre-processing stage of the method includes ISA and SAM to slice the input image and segment the objects in bounding boxes, respectively. In order to improve the resolution in the slices, the first layer of YOLO is designed as SRGAN. Thus, before applying YOLO detection, the resolution of the sliced images is increased to improve features. The proposed system is evaluated on xView and VisDrone datasets for object detection algorithms in satellite and aerial imagery contexts. The success of the algorithm is presented in four different YOLO architectures integrated with SRGAN. According to comparative evaluations, the proposed system with Yolov5 and Yolov8 produces the best results on xView and VisDrone datasets, respectively. Based on the comparisons with the literature, our proposed system produces better results.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Multi-zize Object Detection Model in Aerial Images Based on Super Resolution Model Structure
    Kim, HaeMoon
    Ahn, JongSik
    Lee, Tae-Young
    Choi, Byungin
    JOURNAL OF THE KOREAN SOCIETY FOR AERONAUTICAL AND SPACE SCIENCES, 2023, 51 (08) : 553 - 562
  • [32] Super-resolution algorithm of lunar panchromatic image based on random degradation model
    Lan, Lin
    Lu, Chunling
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXIX, 2023, 12733
  • [33] Super resolution reconstruction of μ-CT image of rock sample using neighbour embedding algorithm
    Wang, Yuzhu
    Rahman, Sheik S.
    Arns, Christoph H.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 493 : 177 - 188
  • [34] Single image super resolution using neighbor embedding and statistical prediction model
    Rahiman, V. Abdu
    George, Sudhish N.
    COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 281 - 292
  • [35] Super-resolution reconstruction of ultrasound image using a modified diffusion model
    Liu, Tianyu
    Han, Shuai
    Xie, Linru
    Xing, Wenyu
    Liu, Chengcheng
    Li, Boyi
    Ta, Dean
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (12)
  • [36] Single Space Object Image Denoising and Super-Resolution Reconstructing Using Deep Convolutional Networks
    Feng, Xubin
    Su, Xiuqin
    Shen, Junge
    Jin, Humin
    REMOTE SENSING, 2019, 11 (16)
  • [37] A non-local approach for image super-resolution using intermodality priors
    Rousseau, Francois
    MEDICAL IMAGE ANALYSIS, 2010, 14 (04) : 594 - 605
  • [38] A new denoising model for multi-frame super-resolution image reconstruction
    El Mourabit, Idriss
    El Rhabi, Mohammed
    Hakim, Abdelilah
    Laghrib, Amine
    Moreau, Eric
    SIGNAL PROCESSING, 2017, 132 : 51 - 65
  • [39] A New Single Image Super-Resolution Method Based on the Infinite Mixture Model
    Cheng, Peitao
    Qiu, Yuanying
    Wang, Xiumei
    Zhao, Ke
    IEEE ACCESS, 2017, 5 : 2228 - 2240
  • [40] A Super-Resolution Algorithm Using Linear Regression Based on Image Self-Similarity
    Tai, Shen-Chuan
    Huang, Jiun-Jie
    Chen, Peng-Yu
    2016 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C), 2016, : 275 - 278