Top-K representative documents query over geo-textual data stream

被引:0
作者
Bin Wang
Rui Zhu
Xiaochun Yang
Guoren Wang
机构
[1] Northeastern University,School of Computer Science and Engineering
来源
World Wide Web | 2018年 / 21卷
关键词
Documents; Geo-textual data stream; Top-k; ELM;
D O I
暂无
中图分类号
学科分类号
摘要
The increasing popularity of location-based social networks encourages more and more users to share their experiences. It deeply impacts the decision of customers when shopping, traveling, and so on. This paper studies the problem of top-K valuable documents query over geo-textual data stream. Many researchers have studied this problem. However, they do not consider the reliability of documents, where some unreliable documents may mislead customers to make improper decisions. In addition, they lack the ability to prune documents with low representativeness. In order to increase user satisfaction in recommendation systems, we propose a novel framework named PDS. It first employs an efficiently machine learning technique named ELM to prune unreliable documents, and then uses a novel index named Gℋ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$\mathcal {GH}$\end{document} to maintain documents. For one thing, this index maintains a group of pruning values to filter low quality documents. For another, it utilizes the unique property of sliding window to further enhance the PDS performance. Theoretical analysis and extensive experimental results demonstrate the effectiveness of the proposed algorithms.
引用
收藏
页码:537 / 555
页数:18
相关论文
共 50 条
[1]  
Bai M(2016)Discovering the k representative skyline over a sliding window IEEE Trans. Knowl Data Eng. 28 2041-2056
[2]  
Xin J(2016)Distr: A distributed method for the reachability query over large uncertain graphs IEEE Trans. Parallel Distrib. Syst. 27 3172-3185
[3]  
Wang G(2006)Extreme learning Machine: Theory and applications Neurocomputing 70 489-501
[4]  
Zhang L(2012)Extreme learning machine for regression and multiclass classification IEEE Trans. Syst. Man Cybern. 42 513-529
[5]  
Zimmermann R(2009)Online sequential fuzzy extreme learning machine for function approximation and classification problems IEEE Trans. Syst. Man Cybern. 39 1067-1072
[6]  
Ye Y(2016)Conflict-aware event-participant arrangement and its variant for online setting IEEE Trans. Knowl. Data Eng. 28 2281-2295
[7]  
Wu X(2016)Tracking frequent items over distributed probabilistic data World Wide Web 19 579-604
[8]  
Cheng Y(2016)Bottleneck-aware arrangement over event-based social networks: the max-min approach World Wide Web 19 1151-1177
[9]  
Ye Y(2016)Online minimum matching in real-time spatial data Experiments and analysis PVLDB 9 1053-1064
[10]  
Chen L(2014)Selectivity estimation on streaming spatio-textual data using local correlations PVLDB 8 101-112