Mining from distributed and abstracted data

被引:4
作者
Zhang, Xiaofeng [1 ]
Cheung, William K. [2 ]
Ye, Yunming [1 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci & Technol, Shenzhen, Peoples R China
[2] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
PRIVACY PRESERVING DATA; DATABASES; MIXTURE;
D O I
10.1002/widm.1182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering global knowledge from distributed data sources is challenging as there exist several practical concerns such as bandwidth limitation and data privacy. By appropriately abstracting distributed data, various global data mining tasks could still be implemented on the basis of local data abstractions. This article reviews existing techniques related to distributed data mining in abstraction-based data mining. It then discusses open research challenges on mining tasks performed on distributed and abstracted data, describes how global data models (clustering and manifold discovery) could be learnt based on local data models, and points out future research directions. WIREs Data Mining Knowl Discov 2016, 6:167-176. doi: 10.1002/widm.1182 For further resources related to this article, please visit the .
引用
收藏
页码:167 / 176
页数:10
相关论文
共 74 条
[1]  
ADAM NR, 1989, COMPUT SURV, V21, P515, DOI 10.1145/76894.76895
[2]  
Agrawal Rakesh, 2003, P 2003 ACM SIGMOD IN, P86, DOI DOI 10.1145/872757.872771
[3]   Performance evaluation of density-based clustering methods [J].
Aliguliyev, Ramiz M. .
INFORMATION SCIENCES, 2009, 179 (20) :3583-3602
[4]  
Allan J., 2002, TOPIC DETECTION TRAC, DOI [10.1007/978-1-4615-0933-2_1, DOI 10.1007/978-1-4615-0933-2]
[5]  
[Anonymous], IEEE INTELLIGENT INF
[6]  
[Anonymous], 1988, Mixture Models: Inference and Applications to Clustering
[7]  
[Anonymous], 2005, VLDB, DOI DOI 10.5555/1083592.1083696
[8]  
[Anonymous], 2008, P 14 ACM SIGKDD INT, DOI DOI 10.1145/1401890.1401904
[9]  
Bertino E, 2005, PROC INT CONF DATA, P521
[10]   GTM: The generative topographic mapping [J].
Bishop, CM ;
Svensen, M ;
Williams, CKI .
NEURAL COMPUTATION, 1998, 10 (01) :215-234