Fine-grained landuse characterization using ground-based pictures: a deep learning solution based on globally available data

被引:63
作者
Srivastava, Shivangi [1 ]
Vargas Munoz, John E. [2 ]
Lobry, Sylvain [1 ]
Tuia, Devis [1 ]
机构
[1] Wageningen Univ & Res, Lab Geoinformat Sci & Remote Sensing, Wageningen, Netherlands
[2] Univ Estadual Campinas, Inst Comp, Campinas, Brazil
基金
巴西圣保罗研究基金会;
关键词
Landuse characterization; convolutional neural networks; ground-based pictures; volunteered geographic information; urban areas; NETWORK; COVER;
D O I
10.1080/13658816.2018.1542698
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the problem of landuse characterization at the urban-object level using deep learning algorithms. Traditionally, this task is performed by surveys or manual photo interpretation, which are expensive and difficult to update regularly. We seek to characterize usages at the single object level and to differentiate classes such as educational institutes, hospitals and religious places by visual cues contained in side-view pictures from Google Street View (GSV). These pictures provide geo-referenced information not only about the material composition of the objects but also about their actual usage, which otherwise is difficult to capture using other classical sources of data such as aerial imagery. Since the GSV database is regularly updated, this allows to consequently update the landuse maps, at lower costs than those of authoritative surveys. Because every urban-object is imaged from a number of viewpoints with street-level pictures, we propose a deep-learning based architecture that accepts arbitrary number of GSV pictures to predict the fine-grained landuse classes at the object level. These classes are taken from OpenStreetMap. A quantitative evaluation of the area of ile-de-France, France shows that our model outperforms other deep learning-based methods, making it a suitable alternative to manual landuse characterization.
引用
收藏
页码:1117 / 1136
页数:20
相关论文
共 26 条
  • [1] Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339
  • [2] Chen Y.-H., 2017, IEEE INT C COMP VIS
  • [3] What Makes Paris Look like Paris?
    Doersch, Carl
    Singh, Saurabh
    Gupta, Abhinav
    Sivic, Josef
    Efros, Alexei A.
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04):
  • [4] Gebru T., 2017, P NATL ACAD SCI
  • [5] Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
  • [6] Completion of the 2011 National Land Cover Database for the Conterminous United States - Representing a Decade of Land Cover Change Information
    Homer, Collin
    Dewitz, Jon
    Yang, Limin
    Jin, Suming
    Danielson, Patrick
    Xian, George
    Coulston, John
    Herold, Nathaniel
    Wickham, James
    Megown, Kevin
    [J]. PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2015, 81 (05) : 345 - 354
  • [7] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [8] TI-POOLING: transformation-invariant pooling for feature learning in Convolutional Neural Networks
    Laptev, Dmitry
    Savinov, Nikolay
    Buhmann, Joachim M.
    Pollefeys, Marc
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 289 - 297
  • [9] Toward Seamless Multiview Scene Analysis From Satellite to Street Level
    Lefevre, Sebastien
    Tuia, Devis
    Wegner, Jan Dirk
    Produit, Timothee
    Nassar, Ahmed Samy
    [J]. PROCEEDINGS OF THE IEEE, 2017, 105 (10) : 1884 - 1899
  • [10] Movshovitz-Attias Y, 2015, PROC CVPR IEEE, P1693, DOI 10.1109/CVPR.2015.7298778