Data warehouse native feature based OLAP querying with keywords

被引:0
作者
Xiao, Min [1 ,2 ]
Chen, Ling [1 ]
Xia, Hai-Yuan [3 ]
Chen, Gen-Cai [1 ]
机构
[1] College of Computer Science and Technology, Zhejiang University
[2] PLA 75733
[3] Department of State Security, Zhejiang Province
来源
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science) | 2012年 / 46卷 / 06期
关键词
Data warehouse; Keyword; OLAP; Query;
D O I
10.3785/j.issn.1008-973X.2012.06.003
中图分类号
学科分类号
摘要
In order to support naive users to use on-line analytical processing (OLAP) tools, this work proposed a native feature of the dimension and its attributes for the multidimensional model of data warehouse based OLAP query method, which combined OLAP and search engine-alike methods. The method firstly created column based full-text index for dimension tables, secondly generated hit groups in terms of the keywords provided by the user, then constructed a candidate result set by joining these hit groups, and finally presented the ranked results to the user. Based on the features that a user cared much more about the aggregated data and the imbalance of the dimensions and their attributes, irrelative attributes and repeated column values were filtered in proposed method to relieve the negative effects on the ranking results. A weighted coefficient, named dimensional level coefficient (DLC) was also introduced to the text ranking arithmetic. Experiments were conducted on the FoodMart and AdventureWorks provided by Microsoft SQL Server to confirm how these factors influenced the hitting rate. The results indicated that the proposed method achieved a higher hitting rate than the keyword-driven analytical processing (KDAP) methods for the first candidate.
引用
收藏
页码:974 / 979+986
相关论文
共 15 条
[1]  
Bhalotia G., Nakhe C., Hulgeri A., Et al., Keyword searching and browsing in databases using banks, 18th International Conference on Data Engineering, (2002)
[2]  
Agrawal S., Chaudhuri S., Das G., Dbxplorer: A system for keyword-based search over relational databases, 18th International Conference on Data Engineering, (2002)
[3]  
Hristidis V., Papakonstantinou Y., Discover: Keyword search in relational databases, Proceedings of the 28th International Conference on Very Large Data Bases, (2002)
[4]  
Hristidis V., Gravano L., Papakonstantinou Y., Efficient ir-style keyword search over relational databases, Proceedings of the 29th International Conference on Very Large Data Bases, (2003)
[5]  
Balmin A., Hristidis V., Papakonstantinou Y., Objectrank: Authority-based keyword search in databases, Proceedings of the Thirtieth International Conference on Very Large Data Bases, 30, (2004)
[6]  
Ariyachandra T., Watson H., Key organizational factors in data warehouse architecture selection, Decision Support Systems, 49, 2, pp. 200-212, (2010)
[7]  
Sifer M., Lin J., Watanobe Y., Et al., Integrating keyword search with multiple dimension tree views over a summary corpus data cube, Proceedings of the 2010 International Conference on Management of Data, (2010)
[8]  
Wu P., Sismanis Y., Reinwald B., Towards keyword-driven analytical processing, Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, (2007)
[9]  
Apache lucene
[10]  
Lucene scoring