Quality-optimized predictive analytics

被引:6
作者
Anagnostopoulos, Christos [1 ]
机构
[1] Univ Glasgow, Sch Comp Sci, Glasgow G12 8QQ, Lanark, Scotland
关键词
Predictive analytics; Stochastic decision making; Optimal stopping theory; Contextual data streams; Quality of information; SENSOR; CLASSIFICATION; IMPUTATION; NETWORKS;
D O I
10.1007/s10489-016-0807-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
On-line statistical and machine learning analytic tasks over large-scale contextual data streams coming from e.g., wireless sensor networks, Internet of Things environments, have gained high popularity nowadays due to their significance in knowledge extraction, regression and classification tasks, and, more generally, in making sense from large-scale streaming data. The quality of the received contextual information, however, impacts predictive analytics tasks especially when dealing with uncertain data, outliers data, and data containing missing values. Low quality of received contextual data significantly spoils the progressive inference and on-line statistical reasoning tasks, thus, bias is introduced in the induced knowledge, e.g., classification and decision making. To alleviate such situation, which is not so rare in real time contextual information processing systems, we propose a progressive time-optimized data quality-aware mechanism, which attempts to deliver contextual information of high quality to predictive analytics engines by progressively introducing a certain controlled delay. Such a mechanism progressively delivers high quality data as much as possible, thus eliminating possible biases in knowledge extraction and predictive analysis tasks. We propose an analytical model for this mechanism and show the benefits stem from this approach through comprehensive experimental evaluation and comparative assessment with quality-unaware methods over real sensory multivariate contextual data.
引用
收藏
页码:1034 / 1046
页数:13
相关论文
共 39 条
[1]  
Abbott D., 2014, Applied predictive analytics: Principles and techniques for the professional data analyst, V1
[2]  
Anagnostopoulos C, 2013, IEEE 24 INT S PERS I, P8
[3]   Situational computing: An innovative architecture with imprecise reasoning [J].
Anagnostopoulos, C. B. ;
Ntarladimas, Y. ;
Hadjiefthymiades, S. .
JOURNAL OF SYSTEMS AND SOFTWARE, 2007, 80 (12) :1993-2014
[4]   Enhancing situation-aware systems through imprecise reasoning [J].
Anagnostopoulos, Christos ;
Hadjiefthymiades, Stathes .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2008, 7 (10) :1153-1168
[5]   Scaling Out Big Data Missing Value Imputations [J].
Anagnostopoulos, Christos ;
Triantafillou, Peter .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :651-660
[6]   Time-optimized user grouping in Location Based Services [J].
Anagnostopoulos, Christos ;
Hadjiefthymiades, Stathes ;
Kolomvatsos, Kostas .
COMPUTER NETWORKS, 2015, 81 :220-244
[7]   Intelligent Trajectory Classification for Improved Movement Prediction [J].
Anagnostopoulos, Christos ;
Hadjiefthymiades, Stathes .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2014, 44 (10) :1301-1314
[8]   Time-optimized contextual information forwarding in mobile sensor networks [J].
Anagnostopoulos, Christos .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (05) :2317-2332
[9]   Multivariate context collection in mobile sensor networks [J].
Anagnostopoulos, Christos ;
Hadjiefthymiades, Stathes .
COMPUTER NETWORKS, 2013, 57 (06) :1394-1407
[10]  
[Anonymous], 2007, Optimal Stopping Rules