KeEL: Knowledge enhanced entity linking in automatic biography construction

被引:2
作者
Tianlei, Zhang [1 ]
Xinyu, Zhang [1 ]
Mu, Guo [1 ]
机构
[1] Computer Science Department, Tsinghua University, 100084, Beijing
来源
Journal of China Universities of Posts and Telecommunications | 2014年 / 22卷 / 01期
基金
中国国家自然科学基金;
关键词
Biography construction; Entity linking; Knowledge; Knowledge enhanced entity linking; Markov logic network;
D O I
10.1016/S1005-8885(15)60625-2
中图分类号
O211 [概率论(几率论、或然率论)];
学科分类号
摘要
Biography is a direct and extensive way to know the representation of well known peoples, however, for common people, there is poor knowledge for them to be recognized. In recent years, information extraction (IE) technologies have been used to automatically generate biography for any people with online information. One of the key challenges is the entity linking (EL) which can link biography sentence to corresponding entities. Currently the used general EL systems usually generate errors originated from entity name variation and ambiguity. Compared with general text, biography sentences possess unique yet rarely studied relational knowledge (RK) and temporal knowledge (TK), which could sufficiently distinguish entities. This article proposed a new statistical framework called the knowledge enhanced EL (KeEL) system for automated biography construction. It utilizes commonsense knowledge like PK and TK to enhance Entity Linking. The performance of KeEL on Wikipedia data was evaluated. It is shown that, compared with state-of-the-art method, KeEL significantly improves the precision and recall of Entity Linking.
引用
收藏
页码:57 / 64
页数:7
相关论文
共 25 条
[1]  
Bunescu R., Pasca M., Using encyclopedic knowledge for named entity disambiguation, Proceedings of the 11st Conference of the European Chapter of the Association for Computational Linguistic (EACL'06), pp. 9-16, (2006)
[2]  
Cucerzan S., Large-scale named entity disambiguation based on Wikipedia data, Proceedings of the 2007 Joint Meeting of the Conference on Empirical Methods on Natural Language Processing (EMNLP'07) and the Conference on Natural Language Learning (CoNLL'07), pp. 708-716, (2007)
[3]  
Kulkarni S., Singh A., Ramakrishnan G., Et al., Collective annotation of Wikipedia entities in Web text, Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09), pp. 457-466, (2009)
[4]  
Han X.P., Sun L., Zhao J., Collective entity linking in Web text: A graph-based method, Proceedings of the 34th Annual International ACM SIGIR Conference (SIGIR'11), pp. 765-774, (2011)
[5]  
Dredze M., McNamee P., Rao D., Et al., Entity disambiguation for knowledge base population, Proceedings of the 23rd International Conference on Computational Linguistics (COLING'10), pp. 277-285, (2010)
[6]  
Mihalcea R., Csomai A., Wikify!: Linking documents to encyclopedic knowledge, Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM'07), pp. 233-242, (2007)
[7]  
Fader A., Soderland S., Etzioni O., Scaling Wikipedia-based named entity disambiguation to arbitrary Web text, Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09), pp. 21-26, (2009)
[8]  
Zhang W., Su J., Tan C.L., Et al., Entity linking leveraging: Automatically generated annotation, Proceedings of the 23rd International Conference on Computational Linguistics (COLING'10), pp. 1290-1298, (2010)
[9]  
Zheng Z.C., Li F.T., Huang M.L., Et al., Learning to link entities with knowledge base, Human Language Technologies: Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT'10), pp. 483-491, (2010)
[10]  
Milne D., Witten I.H., Learning to link with Wikipedia, Proceedings of the 17th ACM Conference on Information and Knowledge Management (CIKM'08), pp. 509-518, (2008)