A proposal for an approach to mapping susceptibility to landslides using natural language processing and machine learning

被引:12
|
作者
Rodrigues, Saulo Guilherme [1 ]
Silva, Maisa Mendonca [2 ]
Alencar, Marcelo Hazin [3 ]
机构
[1] Univ Fed Pernambuco, Ctr Acad Agreste CAA, Ave Marielle Franco S-N,Km 59, BR-55014900 Caruaru, PE, Brazil
[2] Univ Fed Pernambuco, Dept Engn Management, Ave Arquitetura Cidade Univ, BR-50740550 Recife, PE, Brazil
[3] Univ Fed Pernambuco UFPE, Res Grp Risk Assessment & Modelling Environm Asse, Recife, PE, Brazil
关键词
Mapping susceptibility; Natural language processing; Machine learning; ARTIFICIAL NEURAL-NETWORKS; DATA MINING TECHNIQUES; RANDOM-FOREST; SPATIAL PREDICTION; FLOOD SUSCEPTIBILITY; DECISION TREE; PERFORMANCE EVALUATION; LOGISTIC-REGRESSION; TEXT CLASSIFICATION; GENETIC ALGORITHM;
D O I
10.1007/s10346-021-01643-3
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
Compiling an inventory is a fundamental step for carrying out assessments of landslide hazards. However, data in sufficient quantity and quality are not always available. Thus, this study puts forward an approach for drawing up a landslide inventory using textual data from telephone records, and for mapping hazards of landslides in an urban area. Forty thousand seven hundred ninety-two textual records and the naive Bayes algorithm were used to classify them, and these form the landslide inventory. After creating the inventory, the random forest algorithm with 12 conditioning variables was used to map landslide hazards. The text classification model obtained an accuracy of 0.8671 and a Kappa index of 0.8038. The hazard mapping model obtained accuracy of 0.9503 and an AUC (area under the curve)-ROC (receiver operating characteristics) of 0.9870. The results produced by the model were also compared with real landslides reported in news reports and were shown to be close to what had happened, thus demonstrating the ability of the proposed approach to predict landslides. Finally, the proposed approach can be used in simulation environments, thereby supporting strategic decision-making associated with hazard analysis.
引用
收藏
页码:2515 / 2529
页数:15
相关论文
共 50 条
  • [1] A proposal for an approach to mapping susceptibility to landslides using natural language processing and machine learning
    Saulo Guilherme Rodrigues
    Maisa Mendonça Silva
    Marcelo Hazin Alencar
    Landslides, 2021, 18 : 2515 - 2529
  • [2] Stress detection using natural language processing and machine learning over social interactions
    Nijhawan, Tanya
    Attigeri, Girija
    Ananthakrishna, T.
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [3] Stress detection using natural language processing and machine learning over social interactions
    Tanya Nijhawan
    Girija Attigeri
    T. Ananthakrishna
    Journal of Big Data, 9
  • [4] Landslide Susceptibility Mapping in a Mountainous Area Using Machine Learning Algorithms
    Shahabi, Himan
    Ahmadi, Reza
    Alizadeh, Mohsen
    Hashim, Mazlan
    Al-Ansari, Nadhir
    Shirzadi, Ataollah
    Wolf, Isabelle D.
    Ariffin, Effi Helmy
    REMOTE SENSING, 2023, 15 (12)
  • [5] Automated Genre Classification of Books Using Machine Learning and Natural Language Processing
    Gupta, Shikha
    Agarwal, Mohit
    Jain, Satbir
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 269 - 272
  • [6] Resume Classification System using Natural Language Processing and Machine Learning Techniques
    Ali, Irfan
    Mughal, Nimra
    Khand, Zahid Hussain
    Ahmed, Javed
    Mujtaba, Ghulam
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2022, 41 (01) : 65 - 79
  • [7] Using Natural Language Processing and Machine Learning to Replace Human Content Coders
    Wang, Yilei
    Tian, Jingyuan
    Yazar, Yagizhan
    Ones, Deniz S.
    Landers, Richard N.
    PSYCHOLOGICAL METHODS, 2022,
  • [8] Spatial prediction and mapping of landslide susceptibility using machine learning models
    Chen, Yu
    NATURAL HAZARDS, 2025,
  • [9] A Comparison of Natural Language Processing and Machine Learning Methods for Phishing Email Detection
    Bountakas, Panagiotis
    Koutroumpouchos, Konstantinos
    Xenakis, Christos
    ARES 2021: 16TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, 2021,
  • [10] Landslide Susceptibility Mapping using Machine Learning Algorithm
    Hussain, Muhammad Afaq
    Chen, Zhanlong
    Wang, Run
    Shah, Safeer Ullah
    Shoaib, Muhammad
    Ali, Nafees
    Xu, Daozhu
    Ma, Chao
    CIVIL ENGINEERING JOURNAL-TEHRAN, 2022, 8 (02): : 209 - 224