An Event Extraction Model based on Timeline and User Analysis in Latent Dirichlet Allocation

被引:0
作者
Tsolmon, Bayar [1 ]
Lee, Kyung Soon [2 ]
机构
[1] Chonbuk Natl Univ, Div Comp Sci & Engn, Jeonju, South Korea
[2] CAIIT Chonbuk Natl Univ, Div Comp Sci & Engn, Jeonju, South Korea
来源
SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2014年
基金
新加坡国家研究基金会;
关键词
Event Extraction; Timeline Analysis; User behaviors; Latent Dirichlet Allocation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media such as Twitter has come to reflect the reaction of the general public to major events. Since posts are short and noisy, it is hard to extract reliable events based on word frequency. Even though an event term appears in a particularly low frequency, as long as at least one reliable user mentions the term, it should be extracted. This paper proposes an event extraction method which combines user reliability and timeline analysis. The Latent Dirichlet Allocation (LDA) topic model is adapted with the weights of event terms on timeline and reliable users to extract social events. The reliable users are detected on Twitter according to their tweeting behaviors: socially well-known users and active users. Reliable and low-frequency events can be detected based on reliable users In order to see the effectiveness of the proposed method, experiments are conducted on a Korean tweet collection; the proposed model achieved 72% in precision. This shows that the LDA with timeline and reliable users is effective for extracting events on the Twitter test collection.
引用
收藏
页码:1187 / 1190
页数:4
相关论文
共 9 条
  • [1] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [2] Diao Q., 2012, P 50 ANN M ASS COMP, V1, P536
  • [3] Kanhabua N., 2013, P 22 INT C WORLD WID, P1335
  • [4] Authoritative sources in a hyperlinked environment
    Kleinberg, JM
    [J]. JOURNAL OF THE ACM, 1999, 46 (05) : 604 - 632
  • [5] Nowcasting Events from the Social Web with Statistical Learning
    Lampos, Vasileios
    Cristianini, Nello
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (04)
  • [6] Popescu AM, 2010, P 19 ACM INT C INF K, P1873
  • [7] Sayyadi Hassan, 2009, ICWSM, P311
  • [8] Tinati Ramine, 2012, WWW, P1161, DOI DOI 10.1145/2187980.2188256
  • [9] Tsolmon Bayar, 2012, Natural Language Processing and Information Systems. Proceedings 17th International Conference on Applications of Natural Language to Information Systems, NLDB 2012, P265, DOI 10.1007/978-3-642-31178-9_32