Relation Extraction for Massive News Texts

被引:13
作者
Yin, Libo [1 ]
Meng, Xiang [2 ]
Li, Jianxun [3 ]
Sun, Jianguo [2 ]
机构
[1] China Ind Control Syst Cyber Emergency Response T, Beijing 100043, Peoples R China
[2] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[3] Harbin Inst Technol, Coll Comp Sci & Technol, Harbin 150006, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2019年 / 60卷 / 01期
基金
黑龙江省自然科学基金;
关键词
Entity relation extraction; relation classification; massive news texts;
D O I
10.32604/cmc.2019.05556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of information technology including Internet technologies, the amount of textual information that people need to process daily is increasing. In order to automatically obtain valuable and user-informed information from massive amounts of textual data, many researchers have conducted in-depth research in the area of entity relation extraction. Based on the existing research of word vector and the method of entity relation extraction, this paper designs and implements an method based on support vector machine (SVM) for extracting English entity relationships from massive news texts. The method converts sentences in natural language into a form of numerical matrix that can be understood and processed by computers through word embedding and position embedding. Then the key features are extracted, and feature vectors are constructed and sent to the SVM classifiers for relation classification. In the process of feature extraction, we had two different models to finish the job, one by Principal Component Analysis (PCA) and the other by Convolutional Neural Networks (CNN). We designed experiments to evaluate the algorithm.
引用
收藏
页码:275 / 285
页数:11
相关论文
共 13 条
  • [1] [Anonymous], 2017, DEEP RESIDUAL LEARNI
  • [2] Feng XC, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4002
  • [3] Glorot X., 2010, P 13 INT C ART INT S, P249
  • [4] Gormley Matthew R, 2015, P 2015 C EMP METH NA
  • [5] Lin YK, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P2124
  • [6] Liu S., 2017, P 4 BIOCREATIVE
  • [7] Mooney R. J., 2005, P INT C NEUR INF PRO
  • [8] Qu M., 2017, P WORLD WID WEB C, P1257
  • [9] Quirk C, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1171
  • [10] Riedel S, 2010, LECT NOTES ARTIF INT, V6323, P148, DOI 10.1007/978-3-642-15939-8_10