Heading Direction Estimation Using Deep Learning with Automatic Large-scale Data Acquisition

被引：0

作者：

Berriel, Rodrigo E. ^{[1
]}

Tones, Lucas Tabelini ^{[1
]}

Cardoso, Vinicius B. ^{[1
]}

Guidolini, Ranik ^{[1
]}

Badue, Claudine ^{[1
]}

De Souza, Alberto F. ^{[1
]}

Oliveira-Santos, Thiago ^{[1
]}

机构：

[1] Univ Fed Espirito Santo, Dept Informat, Vitoria, ES, Brazil

来源：

2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2018年

关键词：

Deep Learning; Heading Estimation; Convolutional Neural Networks;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Advanced Driver Assistance Systems (ADAS) have experienced major advances in the past few years. The main objective of ADAS includes keeping the vehicle in the correct road direction, and avoiding collision with other vehicles or obstacles around. In this paper, we address the problem of estimating the heading direction that keeps the vehicle aligned with the road direction. This information can be used in precise localization, road and lane keeping, lane departure warning, and others. To enable this approach, a large-scale database (+1 million images) was automatically acquired and annotated using publicly available platforms such as the Google Street View API and OpenStreetMap. After the acquisition of the database, a CNN model was trained to predict how much the heading direction of a car should change in order to align it to the road 4 meters ahead. To assess the performance of the model, experiments were performed using images from two different sources: a hidden test set from Google Street View (GSV) images and two datasets from our autonomous car (IARA). The model achieved a low mean average error of 2.359 degrees and 2.524 degrees for the GSV and IARA datasets, respectively; performing consistently across the different datasets. It is worth noting that the images from the IARA dataset are very different (camera, FOV, brightness, etc.) from the ones of the GSV dataset, which shows the robustness of the model. In conclusion, the model was trained effortlessly (using automatic processes) and showed promising results in real-world databases working in real-time (more than 75 frames per second).

引用

页数：8

共 36 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

[Anonymous], 2016, ARXIV PREPRINT ARXIV

[3]

[Anonymous], 2016, ARXIV161207139

[4] Speeded-Up Robust Features (SURF) [J].

Bay, Herbert ;

Ess, Andreas ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359

[5] A Particle Filter-based Lane Marker Tracking Approach using a Cubic Spline Model [J].

Berriel, Rodrigo ;

de Aguiar, Edilson ;

de Souza Filho, Vanderlei Vieira ;

Oliveira-Santos, Thiago .

2015 28TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES, 2015, :149-156

[6] Ego-Lane Analysis System (ELAS): Dataset and algorithms [J].

Berriel, Rodrigo F. ;

de Aguiar, Edilson ;

de Souza, Alberto F. ;

Oliveira-Santos, Thiago .

IMAGE AND VISION COMPUTING, 2017, 68 :64-75

[7] Automatic large-scale data acquisition via crowdsourcing for crosswalk classification: A deep learning approach [J].

Berriel, Rodrigo F. ;

Rossi, Franco Schmidt ;

de Souza, Alberto F. ;

Oliveira-Santos, Thiago .

COMPUTERS & GRAPHICS-UK, 2017, 68 :32-42

[8] Deep Learning-Based Large-Scale Automatic Satellite Crosswalk Classification [J].

Berriel, Rodrigo F. ;

Lopes, Andre Teixeira ;

de Souza, Alberto F. ;

Oliveira-Santos, Thiago .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (09) :1513-1517

[9]

Berriel RF, 2017, IEEE IJCNN, P4283, DOI 10.1109/IJCNN.2017.7966398

[10] DeepNav: Learning to Navigate Large Cities [J].

Brahmbhatt, Samarth ;

Hays, James .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3087-3096

← 1 2 3 4 →