Skyline Preference Query Based on Massive and Incomplete Dataset

被引:21
作者
Wang, Yan [1 ]
Shi, Zhan [1 ]
Wang, Junlu [1 ]
Sun, Lingfeng [1 ]
Song, Baoyan [1 ]
机构
[1] Liaoning Univ, Sch Informat, Shenyang 110036, Peoples R China
来源
IEEE ACCESS | 2017年 / 5卷
基金
中国国家自然科学基金;
关键词
Internet of things; incomplete data processing; skyline query; information entropy;
D O I
10.1109/ACCESS.2016.2639558
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Personalized recommendation and the processing of real-time data exemplify the processing of massive data which in the field of Internet-of-Things (IoT) received a great extent of attention in recent literature. The incompleteness of massive data in the IoT is widespread. Obtaining personalized information from the incomplete data set is still puzzled by searching efficient and accurate methods at present. Skyline query is a widely used data processing method, especially in the field of multi-objective decision analysis and data visualization. To eliminate the negative effects on massive data processing in IoT, a novel skyline preference query strategy based on massive and the incomplete data set is proposed in this paper. This strategy simply separates and divides massive and incomplete data set into two parts according to dimension importance and executes skyline query, respectively. The strategy mainly resolves the problem of extracting personalized information from massive and incomplete data set and improves the efficiency of skyline query on massive and incomplete data set. First, this paper presents a skyline preference query strategy based on strict clustering and implements it on dimensions that have higher importance. Second, a skyline preference query strategy based on loose clustering is implemented on dimensions that have lower importance. Finally, integrating local skyline query results, this paper calculates global skyline query results by using information entropy theory. The efficiency and effectiveness of Skyline Preference Query (SPQ) algorithm have been evaluated in terms of response time and result set size through the comparative experiments with ISkyline algorithm and sort-based incomplete data skyline algorithm. A large number of simulation results show that the efficiency of SPQ algorithm is higher than that of other common methods.
引用
收藏
页码:3183 / 3192
页数:10
相关论文
共 22 条
[1]   Enhanced Distributed Dynamic Skyline Query for Wireless Sensor Networks [J].
Ahmed, Khandakar ;
Nafi, Nazmus S. ;
Gregory, Mark A. .
JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2016, 5 (01)
[2]  
Bharuka R., 2013, P 24 AUSTR DAT C AUS, P109
[3]   The Skyline operator [J].
Börzsönyi, S ;
Kossmann, D ;
Stocker, K .
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, :421-430
[4]  
Cao J. J., 2011, COMPUT SCI, V39, P207
[5]  
Chen W., 2005, MICROCOMPUT APPL FEB, P44
[6]  
Ding L. L., 2011, CHIN J COMPUT, V34, P1786
[7]   Internet of Things: A Review of Surveys Based on Context Aware Intelligent Services [J].
Gil, David ;
Ferrandez, Antonio ;
Mora-Mora, Higinio ;
Peral, Jesus .
SENSORS, 2016, 16 (07)
[8]   RFID data interpolation algorithm based on dynamic probabilistic path-event model [J].
Gu Y. ;
Yu G. ;
Li X.-J. ;
Wang Y. .
Ruan Jian Xue Bao/Journal of Software, 2010, 21 (03) :438-451
[9]  
Jeschke S., 2017, IND INTERNET THINGS, P715
[10]   Skyline query processing for incomplete data [J].
Khalefa, Mohamed E. ;
Mokbel, Mohamed F. ;
Levandoski, Justin J. .
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, :556-565