TorontoCity: Seeing the World with a Million Eyes

被引:63
作者
Wang, Shenlong [1 ,2 ]
Bai, Min [1 ,2 ]
Mattyus, Gellert [1 ,2 ]
Chu, Hang [1 ]
Luo, Wenjie [1 ,2 ]
Yang, Bin [1 ,2 ]
Liang, Justin [1 ,2 ]
Cheverie, Joel [1 ]
Fidler, Sanja [1 ]
Urtasun, Raquel [1 ,2 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[2] Uber Adv Technol Grp, Pittsburgh, PA 15201 USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/ICCV.2017.327
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce the TorontoCity benchmark, which covers the full greater Toronto area (GTA) with 712.5km(2) of land, 8439km of road and around 400, 000 buildings. Our benchmark provides different perspectives of the world captured from airplanes, drones and cars driving around the city. Manually labeling such a large scale dataset is infeasible. Instead, we propose to utilize different sources of high-precision maps to create our ground truth. Towards this goal, we develop algorithms that allow us to align all data sources with the maps while requiring minimal human supervision. We have designed a wide variety of tasks including building height estimation (reconstruction), road centerline and curb extraction, building instance segmentation, building contour extraction (reorganization), semantic labeling and scene type classification (recognition). Our pilot study shows that most of these tasks are still difficult for modern convolutional neural networks.
引用
收藏
页码:3028 / 3036
页数:9
相关论文
共 36 条
[1]  
[Anonymous], 2016, CVPR
[2]  
[Anonymous], 2015, SCENENET UNDERSTANDI
[3]  
[Anonymous], 1972, COMPUTER GRAPHICS IM
[4]  
[Anonymous], ECCV
[5]  
[Anonymous], P INDIANA ACAD SCI
[6]  
[Anonymous], 2016, CVPR
[7]  
[Anonymous], 2014, VERY DEEP CONVOLUTIO
[8]  
[Anonymous], IEEE COMPUTER
[9]  
[Anonymous], 2016, CVPR
[10]  
[Anonymous], ISPRS J PHOTOGRAMMET