Relation Extraction from Chinese News Web Documents Based on Weakly Supervised Learning

被引：0

作者：

Qiu, Jing ^{[1
]}

Liao, Lejian ^{[1
]}

Li, Peng ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China

来源：

2009 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS 2009) | 2009年

关键词：

Relation extraction; Kernel method; Machine learning; Weakly supervised;

D O I：

10.1109/INCOS.2009.14

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Extracting instances of a given target relation from a given Web page corpus seems to be the basic work to exploit nearly endless source of knowledge which provided by the World Wide Web. Supervised learning requires a large amount of labeled data, but the data labeling process can be expensive and time consuming. In this paper we present a kernel-based weakly supervised machine learning algorithm for relation extraction. It takes a small set of target relations as input. The goal is to automatically extract arbitrary binary relations from Web documents in the domain of football games. Bootstrapping is used to improve the performance of the system. We also compare the performances on different input example sizes. Experimental results show the effectiveness and benefits of our approach.

引用

页码：219 / 225

页数：7