User Information Extraction in Big Data Environment

被引:0
作者
Wang, Kaiqiang [1 ]
Shi, Yijie [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing, Peoples R China
来源
PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2017年
关键词
information extraction; big data; relation extraction; CRF; Interactive Encyclopedia;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the era of large data, massive unstructured data contains a wealth of knowledge, and relying on artificial to find this knowledge is unrealistic, so we study the method of extracting attributes and attribute value automatically from unstructured text. We use the structured information box of the Chinese interactive encyclopedia to extract the relationship triples for generating the relationship knowledge base, and then use the relationship knowledge base for the back annotation. The sentence including the tuple is added to training corpus. This method avoids the manual annotation and solves the problem of insufficient training corpus effectively, which is proven by some experiments.
引用
收藏
页码:2315 / 2318
页数:4
相关论文
共 11 条
[1]  
Al Zama Faiyaz, 2012, AAAI, P387
[2]  
[Anonymous], 2008, KDD
[3]  
Benson E, 2011, P 49 ANN M ASS COMP, P389
[4]  
Ciot Morgane, P 2013 C EMP METH NA, P1136
[5]  
Craven M, 1999, Proc Int Conf Intell Syst Mol Biol, P77
[6]  
Go A, 2009, PROCESSING, V150
[7]  
Hoffmann Raphael, HLT 11 P 49 ANN M AS, V1, P541
[8]  
Li Jiwei., 2014, Proceedings ofthe 52nd Annual Meeting ofthe Association for Computational Linguistics, P165
[9]  
Mintz Mike, ACL 09 P JOINT C 47, V2, P1003
[10]   Weakly Supervised Extraction of Computer Security Events from Twitter [J].
Ritter, Alan ;
Wright, Evan ;
Casey, William ;
Mitchell, Tom .
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, :896-905