A Classification Model for Road Traffic Incidents on Twitter Data

被引:2
作者
Raksachat, Thawatchai [1 ]
Chawuthai, Rathachai [1 ]
机构
[1] King Mongkuts Inst Technol Ladkrabang, Sch Engn, Bangkok, Thailand
来源
2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022) | 2022年
关键词
Deep Learning; Imbalance Dataset; Twitter Data Analytics; Road Traffic Incident; Text Classification;
D O I
10.1109/ITC-CSCC55581.2022.9894853
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This study aims to create a classification model for road traffic incidents in Thailand using Twitter data. The challenging issue of our work is to deal with highly imbalanced dataset of 5 classes. As we surveyed, some pieces of research solved this issue by the Markov Chains method. However, using the Markov Chains in our dataset provides low performance, so we study the Undersampling, Oversampling, Markov Chains, and Bi-directional Long Short-Term Memory (Bi-LSTM). As we use the Markov Chains as the baseline, the result of our experiment found that using Bi-LSTM provides the improvement of F1-score up to 15.44% against the baseline.
引用
收藏
页码:442 / 445
页数:4
相关论文
共 8 条
[1]  
Akkaradamrongrat S, 2019, INT JOINT CONF COMP, P181, DOI [10.1109/JCSSE.2019.8864181, 10.1109/jcsse.2019.8864181]
[2]  
Bishop C.M., 2006, PATTERN RECOGNITION, DOI [DOI 10.18637/JSS.V017.B05, 10.1117/1.2819119]
[3]  
Chamby-Diaz Jorge Cristhian, 2019, 2019 8th Brazilian Conference on Intelligent Systems (BRACIS). Proceedings, P806, DOI 10.1109/BRACIS.2019.00144
[4]   Detecting Traffic Information From Social Media Texts With Deep Learning Approaches [J].
Chen, Yuanyuan ;
Lv, Yisheng ;
Wang, Xiao ;
Li, Lingxi ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (08) :3049-3058
[5]  
Fernndez A., 2018, Learning from imbalanced data sets, P197, DOI DOI 10.1007/978-3-319-98074-4
[6]  
Salas A, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON SMART GRID AND SMART CITIES (ICSGSC), P303, DOI 10.1109/ICSGSC.2017.8038595
[7]  
Salas A, 2017, IEEE INT C INTELL TR
[8]   Towards Improved Classification Accuracy on Highly Imbalanced Text Dataset Using Deep Neural Language Models [J].
Shaikh, Sarang ;
Daudpota, Sher Muhammad ;
Imran, Ali Shariq ;
Kastrati, Zenun .
APPLIED SCIENCES-BASEL, 2021, 11 (02) :1-20