A Text Mining Approach to Discover Real-Time Transit Events from Twitter

被引:2
作者
Arias Zhanay, Belen [1 ]
Orellana Cordero, Gerardo [1 ]
Orellana Cordero, Marcos [1 ]
Acosta Uriguen, Maria-Ines [1 ]
机构
[1] Univ Azuay, Ave 24 Mayo 7-77, Cuenca, Ecuador
来源
INFORMATION AND COMMUNICATION TECHNOLOGIES OF ECUADOR (TIC.EC) | 2019年 / 884卷
关键词
Text mining; Transit; Traffic; Twitter; Real-time analysis;
D O I
10.1007/978-3-030-02828-2_12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The accelerated growth of the number of inhabitants in the cities brings with it the increase in the number of means of transport, generating new conflicts related to traffic and mobility. This growth and lack of alternative public transportation create a scenario where traffic becomes a serious problem. Such is the case in Cuenca, a city located in Ecuador with a population growth of 15% in the last 7 years, and so has the number of cars. Moreover, transit information is only delivered by traditional media which is not always accurate or in real-time. It is imperative to create a system to discover real-time events to help the population to acquire precise information. With the arising of social networks such as Twitter, new opportunities to solve the transit problem at its origin. Twitter users interact with the social network every day and inform their fellow users of different topics such as transit. We take Twitter as a source of information to feed a real-time system which infers transit data from tweets. We create a predictive model with the use of pre-processing techniques for data cleaning, Support Vector Machines for predictive modeling, dictionaries and Levenshtein distance for location discovery, and finally, association analysis for data pattern finding. Our results show that our approach outperforms the existing works in the field. Furthermore, we have achieved accuracy values greater than 90% in classification subroutines and more than 70% in location discovery. Thus, we have settled a successful prediction model to implement real-time transit discovery in Twitter.
引用
收藏
页码:155 / 169
页数:15
相关论文
共 28 条
[1]  
[Anonymous], 2015, ICONFERENCE 2015 P
[2]  
[Anonymous], 2000, CRISPDM 1 0 STEP BY
[3]  
[Anonymous], ANALISIS COMPORTAMIE
[4]  
[Anonymous], 2016, INT J APPL ENG RES
[5]   Forecasting with Twitter Data [J].
Arias, Marta ;
Arratia, Argimiro ;
Xuriguera, Ramon .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 5 (01)
[6]  
Banados JA, 2014, 5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, P75, DOI 10.1109/IISA.2014.6878768
[7]   Real-time monitoring of Twitter traffic by using semantic networks [J].
Bisio, Federica ;
Meda, Claudia ;
Zunino, Rodolfo ;
Surlinelli, Roberto ;
Scillia, Eugenio ;
Ottaviano, Augusto .
PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, :966-969
[8]  
Borsdorf A., 2015, The Andes: A geographical portrait
[9]   Web-Based Traffic Sentiment Analysis: Methods and Applications [J].
Cao, Jianping ;
Zeng, Ke ;
Wang, Hui ;
Cheng, Jiajun ;
Qiao, Fengcai ;
Wen, Ding ;
Gao, Yanqing .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2014, 15 (02) :844-853
[10]   Choosing multiple parameters for support vector machines [J].
Chapelle, O ;
Vapnik, V ;
Bousquet, O ;
Mukherjee, S .
MACHINE LEARNING, 2002, 46 (1-3) :131-159