Machine Learning and Social Media Harvesting for Wildfire Prevention

被引:0
作者
Laksito, Arif Dwi [1 ]
Kusrini, Kusrini [2 ]
Setyanto, Arief [2 ]
Johari, Muhammad Zuhdi Fikri [1 ]
Maruf, Zauvik Rizaldi [1 ]
Yuana, Kumara Ari [1 ]
Adninda, Gardyas Bidari [3 ]
Kartikakirana, Renindya Azizza [3 ]
Nucifera, Fitria [3 ]
Widayani, Wiwi [1 ]
Chandramouli, Krishna [4 ]
Ezquierdo, Ebroul [5 ]
机构
[1] Univ AMIKOM Yogyakarta, Fac Comp Sci, Yogyakarta, Indonesia
[2] Univ AMIKOM Yogyakarta, Magister Informat Engn, Yogyakarta, Indonesia
[3] Univ Amikom Yogyakarta, Fac Sci & Technol, Yogyakarta, Indonesia
[4] Venaka Treleaf GBR, London, England
[5] Queen Mary Univ London, Sch EECS, Multimedia & Vis Res Grp, London, England
来源
2023 IEEE 13TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS | 2023年
关键词
Deep Learning; Machine Learning; Text Classification; Forest Fire; Social Media;
D O I
10.1109/ICPRS58416.2023.10179001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media can be exploited to ease or better understand many problems in science and technology. Social media sensing become one of the methods to get the update information from crowd. Text is the most unstructured information sources available on that media comes from their users. Taking useful information from the unstructured text however, still challenging. The conversion from unstructured data to a vector need to be carried out texts analysis. Machine learning offer an opportunity overcome the challenges. However, it needs massive amount of labelled data to train the machine learning. The research community pose the fact that the availability of the dataset for non - English languages are limited. Forest fire as one of topics discussed in many social media platforms in many languages including Indonesian language. This work fills the gap by providing a labelled dataset in forest fire topics. Moreover, an exercise of the dataset with some available machine learning techniques are compared. Due to the limited number of data rows, researchers perform data augmentation and observe the impact of the augmentation towards the classifier performance. According to our experiments, we found that Random oversampling (ROS), in general, improves the accuracy score for all models we employed. Bidirectional LSTM with ROS overperforms other classifier and achieves 91%, 91% and 90.7% of recall, precision and f1-score respectively.
引用
收藏
页数:6
相关论文
共 31 条
  • [1] Text augmentation using a graph-based approach and clonal selection algorithm
    Ahmed, Hadeer
    Traore, Issa
    Mamun, Mohammad
    Saad, Sherif
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2023, 11
  • [2] Social media-based COVID-19 sentiment classification model using Bi-LSTM
    Arbane, Mohamed
    Benlamri, Rachid
    Brik, Youcef
    Alahmar, Ayman Diyab
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [3] Boulton C., 2016, P INT AAAI C WEB SOC, V10, P178
  • [4] Carley K. M., 2015, Twitter usage in Indonesia
  • [5] Bidirectional deep recurrent neural networks for process fault classification
    Chadha, Gavneet Singh
    Panambilly, Ambarish
    Schwung, Andreas
    Ding, Steven X.
    [J]. ISA TRANSACTIONS, 2020, 106 (106) : 330 - 342
  • [6] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [7] Sensing Social Media to Forecast COVID-19 Cases
    Comito, Carmela
    [J]. 2022 27TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2022), 2022,
  • [8] Deep Convolution Neural Network sharing for the multi-label images classification
    Coulibaly, Solemane
    Kamsu-Foguem, Bernard
    Kamissoko, Dantouma
    Traore, Daouda
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2022, 10
  • [9] A System Analytics Framework for Detecting Infrastructure-Related Topics in Disasters Using Social Sensing
    Fan, Chao
    Mostafavi, Ali
    Gupta, Aayush
    Zhang, Cheng
    [J]. ADVANCED COMPUTING STRATEGIES FOR ENGINEERING, PT II, 2018, 10864 : 74 - 91
  • [10] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]