Efficient pattern matching with periodical wildcards in uncertain sequences

被引:10
作者
Liu, Huiting [1 ,2 ]
Wang, Lili [1 ,2 ]
Liu, Zhizhong [1 ,2 ]
Zhao, Peng [1 ,2 ]
Wu, Xindong [3 ]
机构
[1] Anhui Univ, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230039, Anhui, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[3] Univ Louisiana Lafayette, Sch Comp & Informat, Lafayette, LA 70504 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Pattern matching; substring matching; wildcards; uncertain sequences; SEQUENTIAL PATTERNS; DATABASES;
D O I
10.3233/IDA-173435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data uncertainty is inherent in many real-world applications such as sensor data monitoring and mobile tracking. Mining sequential patterns from uncertain/inaccurate data, such as sensor readings and GPS trajectories, is important to discover hidden knowledge in such applications. This paper addresses the problem of pattern matching with periodical wildcards for uncertain sequences. We present a dynamic programming approach, called CoDP, to compute the exact probability that a pattern q is a subsequence of an uncertain sequence s, and this approach can be further applied to substring matching for uncertain sequences. The efficiency and effectiveness of our algorithm have been verified through extensive experiments on both real and synthetic data.
引用
收藏
页码:829 / 842
页数:14
相关论文
共 22 条
[1]  
Aggarwal CC, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P29
[2]   A Survey of Uncertain Data Algorithms and Applications [J].
Aggarwal, Charu C. ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (05) :609-623
[3]  
[Anonymous], 2010, SIGMOD Conference
[4]  
Bryne JC, 2008, NUCL ACIDS RES
[5]   Efficient query evaluation on probabilistic databases [J].
Dalvi, Nilesh ;
Suciu, Dan .
VLDB JOURNAL, 2007, 16 (04) :523-544
[6]  
Deshpande A., 2004, VLDB, P588, DOI DOI 10.1016/B978-012088469-8.50053-X
[7]   Sequential pattern mining in databases with temporal uncertainty [J].
Ge, Jiaqi ;
Xia, Yuni ;
Wang, Jian ;
Nadungodage, Chandima Hewa ;
Prabhakar, Sunil .
KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) :821-850
[8]  
Ge TJ, 2011, PROC VLDB ENDOW, V4, P772
[9]   Pattern matching with wildcards and gap-length constraints based on a centrality-degree graph [J].
Guo, Dan ;
Hu, Xuegang ;
Xie, Fei ;
Wu, Xindong .
APPLIED INTELLIGENCE, 2013, 39 (01) :57-74
[10]   A new efficient approach for mining uncertain frequent patterns using minimum data structure without false positives [J].
Lee, Gangin ;
Yun, Unil .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 68 :89-110