Learning Link-Based Naive Bayes Classifiers from Ontology-Extended Distributed Data

被引:0
作者
Caragea, Cornelia [1 ]
Caragea, Doina [2 ]
Honavar, Vasant [1 ]
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
[2] Kansas State Univ, Comp & Informat Sci, Manhattan, KS 66506 USA
来源
ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2009, PT 2 | 2009年 / 5871卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of learning predictive models from multiple large, distributed, autonomous, and hence almost invariably semantically disparate, relational data sources from a user's point of view. We show under fairly general assumptions, how to exploit; data sources annotated with relevant meta data in building predictive models (e.g., classifiers) from a collection of distributed relational data sources, without the need for a centralized data warehouse, while offering strong guarantees of exactness of the learned classifiers relative to their centralized relational learning counterparts. We demonstrate an application of the proposed approach in the case of learning link-based Naive Bayes classifiers and present results of experiments on a text classification task that demonstrate the feasibility of the proposed approach.
引用
收藏
页码:1139 / +
页数:2
相关论文
共 20 条
[1]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[2]  
Calvanese D, 2005, LECT NOTES COMPUT SC, V3363, P321
[3]  
CARAGEA C, 2009, P ICTAI 2009 NEW NEW
[4]  
CARAGEA D, 2005, LNCS, P13
[5]  
Caragea D., 2008, ENCY DATABASE TECHNO
[6]  
Caragea D., 2007, P IJCAI 2007 SWECKA
[7]  
Doan A, 2005, AI MAG, V26, P83
[8]  
DOAN AH, 2003, VLDB J
[9]  
Getoor L., 2002, Journal of Machine Learning Research, V3, P679
[10]  
KALFOGLOU Y, 2005, P SEM INT INT DAGST