A Scheme of Automated Object and Facet Extraction for Faceted Search over XML Data

被引:2
作者
Komamizu, Takahiro [1 ]
Amagasa, Toshiyuki [2 ]
Kitagawa, Hiroyuki [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki, Japan
[2] Univ Tsukuba, Fac Engn Informat & Syst, Tsukuba, Ibaraki, Japan
来源
PROCEEDINGS OF THE 18TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM (IDEAS14) | 2014年
关键词
XML search; faceted search; automatic extraction;
D O I
10.1145/2628194.2628241
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Applying faceted search for XML data enables users to search XML data in an interactive manner. However, applying faceted search is challenging, because faceted search requires target subtrees (objects) and facets to be defined beforehand. To this problem, existing works assume that such objects and/or facets are defined manually, but it is infeasible to manually specify objects and facets in particular when the XML data are huge and/or its structure is quite complicated. To address this problem, this paper proposes an automatic extraction scheme of objects and facets from XML data. We propose two approaches, namely frequency-based approach and semantic-based approach, and also hybrid approach of them. The basic ideas of these approaches are that the frequently occurring XML elements seem to be objects and facets, and such XML elements may have semantically meaningful name. Although the proposed approaches are rather simple, the experiments using real world XML data show that the proposed approaches can automatically extract objects and facets from the XML data.
引用
收藏
页码:338 / 341
页数:4
相关论文
共 11 条
[1]  
[Anonymous], P ACM SIGMOD INT C M
[2]   Inference of Concise Regular Expressions and DTDs [J].
Bex, Geert Jan ;
Neven, Frank ;
Schwentick, Thomas ;
Vansummeren, Stijn .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2010, 35 (02)
[3]  
Fellbaum Christiane, 2005, Encyclopedia of Language & Linguistics, V2, P665
[4]  
Goldman R, 1997, PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, P436
[5]  
Harispe S., 2013, CoRR
[6]  
Komamizu T., 2011, P IIWAS 2011, P28
[7]  
Lin D., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P296
[8]  
Liu Z., 2007, SIGMOD Conference, P329
[9]  
Marwick A., 2008, FACETED NAVIGATION D
[10]  
Tunkelang D., 2009, SYNTHESIS LECT INFOR