Recognition of Objects in the Urban Environment using R-CNN and YOLO Deep Learning Algorithms

被引:0
作者
Saric, Rijad [1 ]
Ulbricht, Markus [2 ]
Krstic, Milos [2 ,3 ]
Kevric, Jasmin [1 ]
Jokic, Dejan [1 ]
机构
[1] Int Burch Univ IBU, Dept Elect & Elect Engn, Sarajevo, Bosnia & Herceg
[2] IHP Leibniz Inst Innovat Mikroelekt, Frankfurt, Germany
[3] Univ Potsdam, Potsdam, Germany
来源
2020 9TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO) | 2020年
关键词
computer vision; deep learning; R-CNN; YOLO; automated driving; neural networks; TensorFlow;
D O I
10.1109/meco49872.2020.9134080
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Over the course of the last decade, the subfield of artificial intelligence, called deep learning, becomes the main technology that provides breakthroughs in the computer vision area. Likewise, deep learning algorithms made a major impact in the automated driving domain. This research aims to apply and evaluate the performance of two pre-trained deep learning algorithms in order to recognize different street objects. Both RCNN, as well as YOLO algorithms, are used to recognize bikes, cars and pedestrians using the public GRAZ-02 dataset composed of 1476 raw images of street objects. Accuracy greater than 90% is achieved in recognizing all considered objects. The fine-tuning and training of both algorithms is established using databases named ImageNet and COCO, and afterwards, trained models are tried on the test data.
引用
收藏
页码:447 / 450
页数:4
相关论文
共 21 条
  • [1] Arvin AM, 2009, LIVE VARIOLA VIRUS: CONSIDERATIONS FOR CONTINUING RESEARCH, P9
  • [2] Babatunde O., 2015, Journal of Agricultural Informatics, V6, P61
  • [3] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING
    BIEDERMAN, I
    [J]. PSYCHOLOGICAL REVIEW, 1987, 94 (02) : 115 - 147
  • [4] Bo Zhang, 2010, 2010 9th IEEE International Conference on Cognitive Informatics (ICCI), DOI 10.1109/COGINF.2010.5599750
  • [5] Calkins H, 2017, J ARRYTHM, V33, P369, DOI 10.1016/j.joa.2017.08.001
  • [6] Hybridization of convergent photogrammetry, computer vision, and artificial intelligence for digital documentation of cultural heritage. A case study: The Magdalena Palace
    Cosido, Oscar
    Iglesias, Andres
    Galvez, Akemi
    Catuogno, Raffaele
    Campi, Massimiliano
    Teran, Leticia
    Sainz, Esteban
    [J]. 2014 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2014, : 369 - 376
  • [7] Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach
    Desjardins, Charles
    Chaib-draa, Brahim
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) : 1248 - 1260
  • [8] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [9] Karami E., 2017, IMAGE IDENTIFICATION
  • [10] A Composite Model of Wound Segmentation Based on Traditional Methods and Deep Neural Networks
    Li, Fangzhao
    Wang, Changjian
    Liu, Xiaohui
    Peng, Yuxing
    Jin, Shiyao
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018