Relation Extraction from Chinese News Web Documents Based on Weakly Supervised Learning

被引:0
作者
Qiu, Jing [1 ]
Liao, Lejian [1 ]
Li, Peng [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
来源
2009 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS 2009) | 2009年
关键词
Relation extraction; Kernel method; Machine learning; Weakly supervised;
D O I
10.1109/INCOS.2009.14
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Extracting instances of a given target relation from a given Web page corpus seems to be the basic work to exploit nearly endless source of knowledge which provided by the World Wide Web. Supervised learning requires a large amount of labeled data, but the data labeling process can be expensive and time consuming. In this paper we present a kernel-based weakly supervised machine learning algorithm for relation extraction. It takes a small set of target relations as input. The goal is to automatically extract arbitrary binary relations from Web documents in the domain of football games. Bootstrapping is used to improve the performance of the system. We also compare the performances on different input example sizes. Experimental results show the effectiveness and benefits of our approach.
引用
收藏
页码:219 / 225
页数:7
相关论文
共 23 条
  • [1] Abney S, 2004, COMPUT LINGUIST, V30, P364
  • [2] Agichtein E., 2000, P 5 ACM C DIG LIBR S
  • [3] [Anonymous], 1995, P 33 ANN M ASS COMP
  • [4] [Anonymous], 1998, INT WORKSH WORLD WID
  • [5] [Anonymous], 2002, P 40 ANN M ASS COMP
  • [6] [Anonymous], P 43 ANN M ASS COMP
  • [7] Bunescu R. C., 2005, P EMNLP 2005 VANC BC
  • [8] Chinchor N., 1994, COMPUT LINGUIST, V19, P409
  • [9] Culotta A., 2004, P ACL 2004 BARC SPAI
  • [10] Feldman R., 2006, P C EMP METH NAT PRO