Detecting ongoing events using contextual word and sentence embeddings

被引:2
作者
Maisonnave, Mariano [1 ]
Delbianco, Fernando [1 ]
Tohme, Fernando [1 ]
Maguitman, Ana [1 ]
Milios, Evangelos [2 ]
机构
[1] Univ Nacl Sur, Bahia Blanca, Buenos Aires, Argentina
[2] Dalhousie Univ, Halifax, NS, Canada
关键词
Ongoing Event Detection; Information Extraction; Contextual embeddings; BERT; RNN; CNN; EXTRACTION;
D O I
10.1016/j.eswa.2022.118257
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces the Ongoing Event Detection (OED) task, which is a specific Event Detection task where the goal is to detect ongoing event mentions only, as opposed to historical, future, hypothetical, or other forms or events that are neither fresh nor current. Any application that needs to extract structured information about ongoing events from unstructured texts can take advantage of an OED system. The main contribution of this paper are the following: (1) it introduces the OED task along with a dataset manually labeled for the task; (2) it presents the design and implementation of an RNN model for the task that uses BERT embeddings to define contextual word and contextual sentence embeddings as attributes, which to the best of our knowledge were never used before for detecting ongoing events in news; (3) it presents an extensive empirical evaluation that includes (i) the exploration of different architectures and hyperparameters, (ii) an ablation test to study the impact of each attribute, and (iii) a comparison with a replication of a state-of-the-art model. The results offer several insights into the importance of contextual embeddings and indicate that the proposed approach is effective in the OED task, outperforming the baseline models.
引用
收藏
页数:13
相关论文
共 59 条
  • [1] A rule dynamics approach to event detection in Twitter with its application to sports and politics
    Adedoyin-Olowe, Mariam
    Gaber, Mohamed Medhat
    Dancausa, Carlos M.
    Stahl, Frederic
    Gomes, Joao Bartolo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 55 : 351 - 360
  • [2] Ahn D., 2006, P WORKSH ANN REAS TI, P1, DOI DOI 10.3115/1629235.1629236
  • [3] I-TWEC: Interactive clustering tool for Twitter
    Arin, Inanc
    Erpam, Mert Kemal
    Saygin, Yucel
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 1 - 13
  • [4] Bojanowski P., 2017, Trans. Assoc. Comput. Linguistics, V5, P135, DOI [DOI 10.1162/TACLA00051, 10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]
  • [5] Boros E, 2018, THESIS U PARIS SACLA
  • [6] Bronstein O, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, P372
  • [7] A novel filter feature selection method using rough set for short text data
    Cekik, Rasim
    Uysal, Alper Kursat
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160
  • [8] Social event detection with retweeting behavior correlation
    Chen, Xi
    Zhou, Xiangmin
    Sellis, Timos
    Li, Xue
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 516 - 523
  • [9] Chen YB, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P167
  • [10] Chieu HL, 2003, 41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P216