Can linguistic features extracted from geo-referenced tweets help building function classification in remote sensing?

被引:18
作者
Haberle, Matthias [1 ,2 ]
Hoffmann, Eike Jens [1 ]
Zhu, Xiao Xiang [1 ,2 ]
机构
[1] Tech Univ Munich TUM, Data Sci Earth Observat SiPEO, Arcisstr 21, D-80333 Munich, Germany
[2] German Aerosp Ctr DLR, EO Data Sci, Munchener Str 20, D-82234 Wessling, Germany
基金
欧洲研究理事会;
关键词
Remote sensing; Decision fusion; Building function classification; Deep learning; Natural language processing; Word embedding; URBAN LAND-COVER; EARTH OBSERVATION; WORD EMBEDDINGS; DATA FUSION; INFORMATION; TWITTER; CITIZENS; SPACE;
D O I
10.1016/j.isprsjprs.2022.04.006
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
The fusion of two or more different data sources is a widely accepted technique in remote sensing while becoming increasingly important due to the availability of big Earth Observation satellite data. As a complementary source of geo-information to satellite data, massive text messages from social media form a temporally quasi-seamless, spatially multi-perspective stream, but with unknown and diverse quality. Despite the uncontrolled quality: can linguistic features extracted from geo-referenced tweets support remote sensing tasks? This work presents a straightforward decision fusion framework for very high-resolution remote sensing images and Twitter text messages. We apply our proposed fusion framework to a land-use classification task - the building function classification task - in which we classify building functions like commercial or residential based on linguistic features derived from tweets and remote sensing images. Using building tags from OpenStreetMap (OSM), we labeled tweets and very high-resolution (VHR) images from Google Maps. We collected English tweets from San Francisco, New York City, Los Angeles, and Washington D.C. and trained a stacked bi-directional LSTM neural network with these tweets. For the aerial images, we predicted building functions with state-of-the-art Convolutional Neural Network (CNN) architectures fine-tuned from ImageNet on the given task. After predicting each modality separately, we combined the prediction probabilities of both models building-wise at a decision level. We show that the proposed fusion framework can improve the classification results of the building type classification task. To the best of our knowledge, we are the first to use semantic contents of Twitter messages and fusing them with remote sensing images to classify building functions at a single building level.
引用
收藏
页码:255 / 268
页数:14
相关论文
共 86 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale
    Albert, Adrian
    Kaur, Jasleen
    Gonzalez, Marta C.
    [J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1357 - 1366
  • [3] Transportation sentiment analysis using word embedding and ontology-based topic modeling
    Ali, Farman
    Kwak, Daehan
    Khan, Pervez
    El-Sappagh, Shaker
    Ali, Amjad
    Ullah, Sana
    Kim, Kye Hyun
    Kwak, Kyung-Sup
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 174 : 27 - 42
  • [4] A SURVEY OF TECHNIQUES FOR EVENT DETECTION IN TWITTER
    Atefeh, Farzindar
    Khreich, Wael
    [J]. COMPUTATIONAL INTELLIGENCE, 2015, 31 (01) : 132 - 164
  • [5] Los Angeles as a digital place: The geographies of user-generated content
    Ballatore, Andrea
    De Sabbata, Stefano
    [J]. TRANSACTIONS IN GIS, 2020, 24 (04) : 880 - 902
  • [6] Understanding heterogeneity in metropolitan India: The added value of remote sensing data for analyzing sub-standard residential areas
    Baud, Isa
    Kuffer, Monika
    Pfeffer, Karin
    Sliuzas, Richard
    Karuppannan, Sadasivam
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2010, 12 (05) : 359 - 374
  • [7] A neural probabilistic language model
    Bengio, Y
    Ducharme, R
    Vincent, P
    Jauvin, C
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) : 1137 - 1155
  • [8] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [9] Bojanowski P., 2017, Trans. Assoc. Comput. Linguistics, V5, P135, DOI [DOI 10.1162/TACLA00051, 10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]
  • [10] Race, religion and the city: twitter word frequency patterns reveal dominant demographic dimensions in the United States
    Bokanyi, Eszter
    Kondor, Daniel
    Dobos, Laszlo
    Sebok, Tamas
    Steger, Jozsef
    Csabai, Istvan
    Vattay, Gabor
    [J]. PALGRAVE COMMUNICATIONS, 2016, 2