Ensemble Learning for Relational Data

被引:0
作者
Eldardiry, Hoda [1 ]
Neville, Jennifer [2 ,3 ]
Rossi, Ryan A. [4 ]
机构
[1] Virginia Tech, Dept Comp Sci, 114 McBryde Hall, Blacksburg, VA 24061 USA
[2] Purdue Univ, Dept Comp Sci, 307 N Univ St, W Lafayette, IN 47907 USA
[3] Purdue Univ, Dept Stat, 307 N Univ St, W Lafayette, IN 47907 USA
[4] Adobe Res, 345 Pk Ave, San Jose, CA 95110 USA
基金
美国国家科学基金会;
关键词
Ensemble learning; relational ensemble; collective classification; collective inference; bias-variance decomposition; relational machine learning; theoretical framework; VARIANCE; BIAS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a theoretical analysis framework for relational ensemble models. We show that ensembles of collective classifiers can improve predictions for graph data by reducing errors due to variance in both learning and inference. In addition, we propose a relational ensemble framework that combines a relational ensemble learning approach with a relational ensemble inference approach for collective classification. The proposed ensemble techniques are applicable for both single and multiple graph settings. Experiments on both synthetic and real-world data demonstrate the effectiveness of the proposed framework. Finally, our experimental results support the theoretical analysis and confirm that ensemble algorithms that explicitly focus on both learning and inference processes and aim at reducing errors associated with both, are the best performers.
引用
收藏
页数:37
相关论文
共 50 条
[21]  
Ganchev Kuzman, 2008, P 24 C UNC ART INT
[22]  
Gao J, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P339
[23]  
Gao Jing, 2009, P 23 ANN C NEUR INF
[24]   NEURAL NETWORKS AND THE BIAS VARIANCE DILEMMA [J].
GEMAN, S ;
BIENENSTOCK, E ;
DOURSAT, R .
NEURAL COMPUTATION, 1992, 4 (01) :1-58
[25]  
Getoor L., 2007, Introduction to Statistical Relational Learning
[26]  
He Andreas, 2004, P 15 EUR C MACH LEAR
[27]   Variance and bias for general loss functions [J].
James, GM .
MACHINE LEARNING, 2003, 51 (02) :115-135
[28]  
Kato Tsuyoshi, 2008, P SIAM C DAT MIN
[29]  
Kou Z., 2007, P SIAM INT C DAT MIN
[30]   A statistical framework for genomic data fusion [J].
Lanckriet, GRG ;
De Bie, T ;
Cristianini, N ;
Jordan, MI ;
Noble, WS .
BIOINFORMATICS, 2004, 20 (16) :2626-2635