Knowledge-based Approach for Event Extraction from Arabic Tweets

被引:0
作者
AL-Smadi, Mohammad [1 ]
Qawasmeh, Omar [1 ]
机构
[1] Jordan Univ Sci & Technol, Dept Comp Sci, POB 3030, Irbid 22110, Jordan
关键词
Event Extraction; Knowledge base; Entity linking; Named entity disambiguation; Arabic NLP;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Tweets provide a continuous update on current events. However, Tweets are short, personalized and noisy, thus raises more challenges for event extraction and representation. Extracting events out of Arabic tweets is a new research domain where few examples - if any - of previous work can be found. This paper describes a knowledge-based approach for fostering event extraction out of Arabic tweets. The approach uses an unsupervised rule-based technique for event extraction and provides a named entity disambiguation of event related entities (i.e. person, organization, and location). Extracted events and their related entities are populated to the event knowledge base where tagged tweets' entities are linked to their corresponding entities represented in the knowledge base. Proposed approach was evaluated on a dataset of 1K Arabic tweets covering different types of events (i.e. instant events and interval events). Results show that the approach has an accuracy of, 75.9% for event trigger extraction, 87.5% for event time extraction, and 97.7% for event type identification.
引用
收藏
页码:483 / 490
页数:8
相关论文
共 36 条
[1]   Using NLP techniques for tagging events in Arabic text [J].
Abuleil, Saleem. .
19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL II, PROCEEDINGS, 2007, :440-443
[2]  
Ahn D., 2006, PROC WORKSHOP ANNOTA, P1
[3]  
Al-Smadi M., 2015, PROC 15 INT C KNOWLE
[4]  
Aliane H., 2013, RANLP, P25
[5]  
Allan J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P37, DOI 10.1145/290941.290954
[6]   Arabic Event Detection in Social Media [J].
Alsaedi, Nasser ;
Burnap, Pete .
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 :384-401
[7]  
ALTHOBAITI M, 2014, ARANLP JAVA BASED LI
[8]  
[Anonymous], 2008, ACE AUTOMATIC CONTEN
[9]  
Auer S., 2007, DBPEDIA NUCL WEB OPE
[10]  
Becker H, 2011, ICWSM