An influence assessment method based on co-occurrence for topologically reduced big data sets

被引:11
作者
Trovati, Marcello [1 ]
Bessis, Nik [1 ]
机构
[1] Univ Derby, Dept Comp & Math, Derby DE22 1GB, England
关键词
Knowledge discovery; Large-scale networks; Information extraction; Data analytics;
D O I
10.1007/s00500-015-1621-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The extraction of meaningful, accurate, and relevant information is at the core of Big Data research. Furthermore, the ability to obtain an insight is essential in any decision-making process, even though the diverse and complex nature of big data sets raises a multitude of challenges. In this paper, we propose a novel method to address the automated assessment of influence among concepts in big data sets. This is carried out by investigating their mutual co-occurrence, which is determined via topologically reducing the corresponding network. The main motivation is to provide a toolbox to classify and analyse influence properties, which can be used to investigate their dynamical and statistical behaviour, potentially leading to a better understanding and prediction of the properties of the system(s) they model. An evaluation was carried out on two real-world data sets, which were analysed to test the capabilities of our system. The results show the potential of our approach, indicating both accuracy and efficiency.
引用
收藏
页码:2021 / 2030
页数:10
相关论文
共 26 条
[1]   Statistical mechanics of complex networks [J].
Albert, R ;
Barabási, AL .
REVIEWS OF MODERN PHYSICS, 2002, 74 (01) :47-97
[2]  
[Anonymous], 2006, P 5 INT C LANGUAGE R
[3]  
[Anonymous], 1998, GRAD TEXT M
[4]  
[Anonymous], 2012, SENTIMENT ANAL OPINI
[5]  
[Anonymous], 1990, Introduction to Algorithms
[6]  
[Anonymous], 2010, P 3 ACM INT C WEB SE, DOI DOI 10.1145/1718487.1718520
[7]   Dimensionality reduction of medical big data using neural-fuzzy classifier [J].
Azar, Ahmad Taher ;
Hassanien, Aboul Ella .
SOFT COMPUTING, 2015, 19 (04) :1115-1127
[8]   Scale-free topology of e-mail networks [J].
Ebel, H ;
Mielsch, LI ;
Bornholdt, S .
PHYSICAL REVIEW E, 2002, 66 (03) :1-035103
[9]  
Engelhardt-Nowitzki C., 2011, 2011 International Conference on Emerging Intelligent Data and Web Technologies, P131, DOI 10.1109/EIDWT.2011.28
[10]  
Feldman R., 2006, TEXT MINING HDB