Fine-grained landuse characterization using ground-based pictures: a deep learning solution based on globally available data

被引:68
作者
Srivastava, Shivangi [1 ]
Vargas Munoz, John E. [2 ]
Lobry, Sylvain [1 ]
Tuia, Devis [1 ]
机构
[1] Wageningen Univ & Res, Lab Geoinformat Sci & Remote Sensing, Wageningen, Netherlands
[2] Univ Estadual Campinas, Inst Comp, Campinas, Brazil
基金
巴西圣保罗研究基金会;
关键词
Landuse characterization; convolutional neural networks; ground-based pictures; volunteered geographic information; urban areas; NETWORK; COVER;
D O I
10.1080/13658816.2018.1542698
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the problem of landuse characterization at the urban-object level using deep learning algorithms. Traditionally, this task is performed by surveys or manual photo interpretation, which are expensive and difficult to update regularly. We seek to characterize usages at the single object level and to differentiate classes such as educational institutes, hospitals and religious places by visual cues contained in side-view pictures from Google Street View (GSV). These pictures provide geo-referenced information not only about the material composition of the objects but also about their actual usage, which otherwise is difficult to capture using other classical sources of data such as aerial imagery. Since the GSV database is regularly updated, this allows to consequently update the landuse maps, at lower costs than those of authoritative surveys. Because every urban-object is imaged from a number of viewpoints with street-level pictures, we propose a deep-learning based architecture that accepts arbitrary number of GSV pictures to predict the fine-grained landuse classes at the object level. These classes are taken from OpenStreetMap. A quantitative evaluation of the area of ile-de-France, France shows that our model outperforms other deep learning-based methods, making it a suitable alternative to manual landuse characterization.
引用
收藏
页码:1117 / 1136
页数:20
相关论文
共 26 条
[1]  
[Anonymous], 2017, P IEEE INT C COMP VI
[2]  
Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339
[3]  
Chen Y., 2017, P IEEE INT C COMP VI
[4]   What Makes Paris Look like Paris? [J].
Doersch, Carl ;
Singh, Saurabh ;
Gupta, Abhinav ;
Sivic, Josef ;
Efros, Alexei A. .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)
[5]  
Gebru T., 2017, P NATL ACAD SCI
[6]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[7]   Completion of the 2011 National Land Cover Database for the Conterminous United States - Representing a Decade of Land Cover Change Information [J].
Homer, Collin ;
Dewitz, Jon ;
Yang, Limin ;
Jin, Suming ;
Danielson, Patrick ;
Xian, George ;
Coulston, John ;
Herold, Nathaniel ;
Wickham, James ;
Megown, Kevin .
PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2015, 81 (05) :345-354
[8]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[9]   TI-POOLING: transformation-invariant pooling for feature learning in Convolutional Neural Networks [J].
Laptev, Dmitry ;
Savinov, Nikolay ;
Buhmann, Joachim M. ;
Pollefeys, Marc .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :289-297
[10]   Toward Seamless Multiview Scene Analysis From Satellite to Street Level [J].
Lefevre, Sebastien ;
Tuia, Devis ;
Wegner, Jan Dirk ;
Produit, Timothee ;
Nassar, Ahmed Samy .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1884-1899