Improving Semantic Relation Extraction System with Compositional Dependency Unit on Enriched Shortest Dependency Path

被引:0
作者
Duy-Cat Can [1 ]
Hoang-Quynh Le [1 ]
Quang-Thuy Ha [1 ]
机构
[1] Vietnam Natl Univ Hanoi, Univ Engn & Technol, Fac Informat Technol, Hanoi, Vietnam
来源
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT I | 2019年 / 11431卷
关键词
Relation extraction; Dependency unit; Shortest dependency path; Convolutional neural network;
D O I
10.1007/978-3-030-14799-0_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Experimental performance on the task of relation extraction/classification has generally improved using deep neural network architectures. In which, data representation has been proven to be one of the most influential factors to the model's performance but still has many limitations. In this work, we take advantage of compressed information in the shortest dependency path (SDP) between two corresponding entities to classify the relation between them. We propose (i) a compositional embedding that combines several dominant linguistic as well as architectural features and (ii) dependency tree normalization techniques for generating rich representations for both words and dependency relations in the SDP. We also present a Convolutional Neural Network (CNN) model to process the proposed SDP enriched representation. Experimental results for both general and biomedical data demonstrate the effectiveness of compositional embedding, dependency tree normalization technique as well as the suitability of the CNN model.
引用
收藏
页码:140 / 152
页数:13
相关论文
共 20 条
  • [1] [Anonymous], 2010, P ICML 10 P 27 INT C
  • [2] Bojanowski Piotr, 2017, Trans. Assoc. Comput. Linguist., V5, P135, DOI DOI 10.1162/TACL_A_00051
  • [3] Bunescu R. C., 2005, P C HUM LANG TECHN E, P724, DOI DOI 10.3115/1220575.1220666
  • [4] Opportunities and obstacles for deep learning in biology and medicine
    Ching, Travers
    Himmelstein, Daniel S.
    Beaulieu-Jones, Brett K.
    Kalinin, Alexandr A.
    Do, Brian T.
    Way, Gregory P.
    Ferrero, Enrico
    Agapow, Paul-Michael
    Zietz, Michael
    Hoffman, Michael M.
    Xie, Wei
    Rosen, Gail L.
    Lengerich, Benjamin J.
    Israeli, Johnny
    Lanchantin, Jack
    Woloszynek, Stephen
    Carpenter, Anne E.
    Shrikumar, Avanti
    Xu, Jinbo
    Cofer, Evan M.
    Lavender, Christopher A.
    Turaga, Srinivas C.
    Alexandari, Amr M.
    Lu, Zhiyong
    Harris, David J.
    DeCaprio, Dave
    Qi, Yanjun
    Kundaje, Anshul
    Peng, Yifan
    Wiley, Laura K.
    Segler, Marwin H. S.
    Boca, Simina M.
    Swamidass, S. Joshua
    Huang, Austin
    Gitter, Anthony
    Greene, Casey S.
    [J]. JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2018, 15 (141)
  • [5] Hendrickx Iris, 2009, Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, DEW '09, P94
  • [6] Sieve-based coreference resolution enhances semi-supervised learning model for chemical-induced disease relation extraction
    Hoang-Quynh Le
    Mai-Vu Tran
    Thanh Hai Dang
    Quang-Thuy Ha
    Collier, Nigel
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [7] Kingma J., 2015, INT C LEARNING REPRE
  • [8] Le HQ, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2266
  • [9] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [10] Liu Y, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, P285