Real-Time Object Detection using an Ultra-High-Resolution Camera on Embedded Systems

被引:2
作者
Antonakakis, Marios [1 ]
Tzavaras, Aimilios [1 ]
Tsakos, Konstantinos [1 ]
Spanakis, Emmanouil G. [2 ]
Sakkalis, Vangelis [2 ]
Zervakis, Michalis [1 ]
Petrakis, Euripides G. M. [1 ]
机构
[1] Tech Univ Crete, Sch Elect & Comp Engn, Khania, Crete, Greece
[2] Fdn Res & Technol FORTH, Inst Comp Sci, Iraklion, Crete, Greece
来源
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST 2022) | 2022年
基金
欧盟地平线“2020”;
关键词
ultra-high-resolution images; real-time object detection; YOLOv5; embedded systems; remote sensing;
D O I
10.1109/IST55454.2022.9827742
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unnamed Aerial Vehicle (UAV) - based remote sensing is a promising technology that is being applied for inspecting live scenes from high altitudes (e.g., for surveillance and recognizing emergencies)..he evolution of hardware and software technologies in the last few years has generated additional interest in embedded systems research and its implementation in energy-independent UAVs for remote sensing. Alongside, ultra-high-resolution optical sensors are mandatory for acquiring high-resolution images which are necessary for accurate object detection from a distance (e.g., 1,000 meters). The processing of ultra-high-resolution images (e.g., 4K or 8K) is beyond the typical resolutions which are used for object detection (e.g., < 2K) emerging a necessity for special treatment in order to succeed a fast object detection. We propose a three-step approach deployed on a Docker runtime environment in an Nvidia Jetson AGX Xavier board. To support fast object detection, the captured images are split into K parts processed in parallel in separate containers running the YOLOv5 object detection algorithm. A final detection is constructed based on each one of the K detections. The experimental results are a good support to our claims of efficiency: the method can achieve close to real-time object detection for ultra-high (i.e., 8K) resolution images (i.e., in less than 1 second per frame).
引用
收藏
页数:6
相关论文
共 16 条
[1]  
Bradski G, 2000, DR DOBBS J, V25, P120
[2]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[3]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[4]  
glenn-jocher, 2021, YOLOv5 Focus() Layer #3181, Ultralytics: Github
[5]  
Grinberg M., 2018, Flask Web Development: Developing Web Applications with Python
[6]  
He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
[7]  
Iandola F, 2014, Arxiv, DOI arXiv:1404.1869
[8]  
Johnson J., 1958, IMAGE INTENSIFIER S, P244
[9]  
Li Z., 2022, J PHYS C SER, V2171
[10]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944