"Looking at the right stuff" - Guided semantic-gaze for autonomous driving

被引：25

作者：

Pal, Anwesan ^{[1
]}

Mondal, Sayan ^{[1
]}

Christensen, Henrik, I ^{[1
]}

机构：

[1] Univ Calif San Diego, La Jolla, CA 92093 USA

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年

关键词：

SALIENT; ATTENTION; MODEL;

D O I：

10.1109/CVPR42600.2020.01190

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, predicting driver's focus of attention has been a very active area of research in the autonomous driving community. Unfortunately, existing state-of-the-art techniques achieve this by relying only on human gaze information, thereby ignoring scene semantics. We propose a novel Semantics Augmented GazE (SAGE) detection approach that captures driving specific contextual information, in addition to the raw gaze. Such a combined attention mechanism serves as a powerful tool to focus on the relevant regions in an image frame in order to make driving both safe and efficient. Using this, we design a complete saliency prediction framework - SAGE-Net, which modifies the initial prediction from SAGE by taking into account vital aspects such as distance to objects (depth), ego vehicle speed, and pedestrian crossing intent. Exhaustive experiments conducted through four popular saliency algorithms show that on 49/56 (87.5%) cases - considering both the overall dataset and crucial driving scenarios, SAGE outperforms existing techniques without any additional computational overhead during the training process. The augmented dataset along with the relevant code are available as part of the supplementary material.(1)

引用

页码：11880 / 11889

页数：10

共 55 条

[1]

Abdulla W., 2017, Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

[2]

Achanta R, 2008, LECT NOTES COMPUT SC, V5008, P66

[3]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[4] DR(eye) VE: a Dataset for Attention-Based Tasks with Applications to Autonomous and Assisted Driving [J].

Alletto, Stefano ;

Palazzi, Andrea ;

Solera, Francesco ;

Calderara, Simone ;

Cucchiara, Rita .

PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, :54-60

[5]

[Anonymous], 2017, IEEE T CIRCUITS SYST

[6]

[Anonymous], 2015, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2015.7298710

[7]

[Anonymous], 2006, Advances in neural information processing systems

[8]

Borji Ali, 2019, [Computational Visual Media, 计算可视媒体], V5, P117

[9] What Do Different Evaluation Metrics Tell Us About Saliency Models? [J].

Bylinskii, Zoya ;

Judd, Tilke ;

Oliva, Aude ;

Torralba, Antonio ;

Durand, Fredo .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (03) :740-757

[10] Attention-based Dropout Layer for Weakly Supervised Object Localization [J].

Choe, Junsuk ;

Shim, Hyunjung .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2214-2223

← 1 2 3 4 5 6 →