Towards Keypoint Guided Self-supervised Depth Estimation

被引:0
作者
Bartol, Kristijan [1 ]
Bojanic, David [1 ]
Petkovic, Tomislav [1 ]
Pribanic, Tomislav [1 ]
Donoso, Yago [2 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Zagreb, Croatia
[2] Yamagata Univ, Fac Sci, Yamagata, Japan
来源
VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP | 2020年
关键词
Monocular Depth Estimation; Self-supervised Learning; Keypoint Similarity Loss;
D O I
10.5220/0009190005830589
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes to use keypoints as a self-supervision clue for learning depth map estimation from a collection of input images. As ground truth depth from real images is difficult to obtain, there are many unsupervised and self-supervised approaches to depth estimation that have been proposed. Most of these unsupervised approaches use depth map and ego-motion estimations to reproject the pixels from the current image into the adjacent image from the image collection. Depth and ego-motion estimations are evaluated based on pixel intensity differences between the correspondent original and reprojected pixels. Instead of reprojecting the individual pixels, we propose to first select image keypoints in both images and then reproject and compare the correspondent keypoints of the two images. The keypoints should describe the distinctive image features well. By learning a deep model with and without the keypoint extraction technique, we show that using the keypoints improve the depth estimation learning. We also propose some future directions for keypoint-guided learning of structure-from-motion problems.
引用
收藏
页码:583 / 589
页数:7
相关论文
共 14 条
[1]  
[Anonymous], 2016, CORR
[2]  
[Anonymous], 2016, CORR
[3]  
[Anonymous], 2004, INT J COMPUTER VISIO
[4]  
[Anonymous], 1981, MULTIPLE VIEW GEOMET
[5]  
Bojanic D., 2019, 11 INT S IM SIGN PRO
[6]  
Cordts M., 2016, CORR
[7]  
Furukawa Yasutaka., 2015, Foundations and Trends in Computer Graphics and Vision
[8]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[9]  
Godard C., 2018, INT C COMP VIS ICCV
[10]  
Li W., 2018, BRIT MACH VIS C BMVC