Recognition of Objects in the Urban Environment using R-CNN and YOLO Deep Learning Algorithms

被引：0

作者：

Saric, Rijad ^{[1
]}

Ulbricht, Markus ^{[2
]}

Krstic, Milos ^{[2
,3
]}

Kevric, Jasmin ^{[1
]}

Jokic, Dejan ^{[1
]}

机构：

[1] Int Burch Univ IBU, Dept Elect & Elect Engn, Sarajevo, Bosnia & Herceg

[2] IHP Leibniz Inst Innovat Mikroelekt, Frankfurt, Germany

[3] Univ Potsdam, Potsdam, Germany

来源：

2020 9TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO) | 2020年

关键词：

computer vision; deep learning; R-CNN; YOLO; automated driving; neural networks; TensorFlow;

D O I：

10.1109/meco49872.2020.9134080

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Over the course of the last decade, the subfield of artificial intelligence, called deep learning, becomes the main technology that provides breakthroughs in the computer vision area. Likewise, deep learning algorithms made a major impact in the automated driving domain. This research aims to apply and evaluate the performance of two pre-trained deep learning algorithms in order to recognize different street objects. Both RCNN, as well as YOLO algorithms, are used to recognize bikes, cars and pedestrians using the public GRAZ-02 dataset composed of 1476 raw images of street objects. Accuracy greater than 90% is achieved in recognizing all considered objects. The fine-tuning and training of both algorithms is established using databases named ImageNet and COCO, and afterwards, trained models are tried on the test data.

引用

页码：447 / 450

页数：4

共 21 条

[1] Arvin AM, 2009, LIVE VARIOLA VIRUS: CONSIDERATIONS FOR CONTINUING RESEARCH, P9
[2] Babatunde O., 2015, Journal of Agricultural Informatics, V6, P61
[3] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING
BIEDERMAN, I
[J]. PSYCHOLOGICAL REVIEW, 1987, 94 (02) : 115 - 147
[4] Bo Zhang, 2010, 2010 9th IEEE International Conference on Cognitive Informatics (ICCI), DOI 10.1109/COGINF.2010.5599750
[5] Calkins H, 2017, J ARRYTHM, V33, P369, DOI 10.1016/j.joa.2017.08.001
[6] Hybridization of convergent photogrammetry, computer vision, and artificial intelligence for digital documentation of cultural heritage. A case study: The Magdalena Palace
Cosido, Oscar
Iglesias, Andres
Galvez, Akemi
Catuogno, Raffaele
Campi, Massimiliano
Teran, Leticia
Sainz, Esteban
[J]. 2014 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2014, : 369 - 376
[7] Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach
Desjardins, Charles
Chaib-draa, Brahim
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) : 1248 - 1260
[8] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[9] Karami E., 2017, IMAGE IDENTIFICATION
[10] A Composite Model of Wound Segmentation Based on Traditional Methods and Deep Neural Networks
Li, Fangzhao
Wang, Changjian
Liu, Xiaohui
Peng, Yuxing
Jin, Shiyao
[J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018

← 1 2 3 →