Can linguistic features extracted from geo-referenced tweets help building function classification in remote sensing?

被引：18

作者：

Haberle, Matthias ^{[1
,2
]}

Hoffmann, Eike Jens ^{[1
]}

Zhu, Xiao Xiang ^{[1
,2
]}

机构：

[1] Tech Univ Munich TUM, Data Sci Earth Observat SiPEO, Arcisstr 21, D-80333 Munich, Germany

[2] German Aerosp Ctr DLR, EO Data Sci, Munchener Str 20, D-82234 Wessling, Germany

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2022年 / 188卷

基金：

欧洲研究理事会;

关键词：

Remote sensing; Decision fusion; Building function classification; Deep learning; Natural language processing; Word embedding; URBAN LAND-COVER; EARTH OBSERVATION; WORD EMBEDDINGS; DATA FUSION; INFORMATION; TWITTER; CITIZENS; SPACE;

D O I：

10.1016/j.isprsjprs.2022.04.006

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

The fusion of two or more different data sources is a widely accepted technique in remote sensing while becoming increasingly important due to the availability of big Earth Observation satellite data. As a complementary source of geo-information to satellite data, massive text messages from social media form a temporally quasi-seamless, spatially multi-perspective stream, but with unknown and diverse quality. Despite the uncontrolled quality: can linguistic features extracted from geo-referenced tweets support remote sensing tasks? This work presents a straightforward decision fusion framework for very high-resolution remote sensing images and Twitter text messages. We apply our proposed fusion framework to a land-use classification task - the building function classification task - in which we classify building functions like commercial or residential based on linguistic features derived from tweets and remote sensing images. Using building tags from OpenStreetMap (OSM), we labeled tweets and very high-resolution (VHR) images from Google Maps. We collected English tweets from San Francisco, New York City, Los Angeles, and Washington D.C. and trained a stacked bi-directional LSTM neural network with these tweets. For the aerial images, we predicted building functions with state-of-the-art Convolutional Neural Network (CNN) architectures fine-tuned from ImageNet on the given task. After predicting each modality separately, we combined the prediction probabilities of both models building-wise at a decision level. We show that the proposed fusion framework can improve the classification results of the building type classification task. To the best of our knowledge, we are the first to use semantic contents of Twitter messages and fusing them with remote sensing images to classify building functions at a single building level.

引用

页码：255 / 268

页数：14

共 86 条

[1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2] Using Convolutional Networks and Satellite Imagery to Identify Patterns in Urban Environments at a Large Scale
Albert, Adrian
Kaur, Jasleen
Gonzalez, Marta C.
[J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1357 - 1366
[3] Transportation sentiment analysis using word embedding and ontology-based topic modeling
Ali, Farman
Kwak, Daehan
Khan, Pervez
El-Sappagh, Shaker
Ali, Amjad
Ullah, Sana
Kim, Kye Hyun
Kwak, Kyung-Sup
[J]. KNOWLEDGE-BASED SYSTEMS, 2019, 174 : 27 - 42
[4] A SURVEY OF TECHNIQUES FOR EVENT DETECTION IN TWITTER
Atefeh, Farzindar
Khreich, Wael
[J]. COMPUTATIONAL INTELLIGENCE, 2015, 31 (01) : 132 - 164
[5] Los Angeles as a digital place: The geographies of user-generated content
Ballatore, Andrea
De Sabbata, Stefano
[J]. TRANSACTIONS IN GIS, 2020, 24 (04) : 880 - 902
[6] Understanding heterogeneity in metropolitan India: The added value of remote sensing data for analyzing sub-standard residential areas
Baud, Isa
Kuffer, Monika
Pfeffer, Karin
Sliuzas, Richard
Karuppannan, Sadasivam
[J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2010, 12 (05) : 359 - 374
[7] A neural probabilistic language model
Bengio, Y
Ducharme, R
Vincent, P
Jauvin, C
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) : 1137 - 1155
[8] Latent Dirichlet allocation
Blei, DM
Ng, AY
Jordan, MI
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
[9] Bojanowski P., 2017, Trans. Assoc. Comput. Linguistics, V5, P135, DOI [DOI 10.1162/TACLA00051, 10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]
[10] Race, religion and the city: twitter word frequency patterns reveal dominant demographic dimensions in the United States
Bokanyi, Eszter
Kondor, Daniel
Dobos, Laszlo
Sebok, Tamas
Steger, Jozsef
Csabai, Istvan
Vattay, Gabor
[J]. PALGRAVE COMMUNICATIONS, 2016, 2

← 1 2 3 4 5 6 7 8 9 →