Scalable aggregate keyword query over knowledge graph

被引:14
作者
Hu, Xin [1 ]
Duan, Jiangli [1 ,2 ]
Dang, Depeng [3 ]
机构
[1] Yangtze Normal Univ, Coll Big Data & Intelligent Engn, Chongqing, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing, Peoples R China
[3] Beijing Normal Univ, Sch Artificial Intelligence, Beijing, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2020年 / 107卷
基金
中国国家自然科学基金;
关键词
Knowledge graph; Question answering; Keyword search; Aggregation; SEARCH;
D O I
10.1016/j.future.2020.02.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Existing keyword query systems over knowledge graphs are easy to use and can produce interesting results. However, they cannot address even simple aggregate queries (i.e., a query that needs statistics such as COUNT, SUM, AVG, MAX, MIN, >, < and =), and the sizes of existing schema graphs grow exponentially with the growth of the number of types or predicates in the knowledge graph, so that they have low scalability for building SPARQL statements. Therefore, we propose a framework called SAKQ (scalable aggregate keyword query over knowledge graph) that enables users to pose aggregate queries using simple keywords. First, we propose a scalable schema graph (i.e., type-predicate graph) that consists of the relationships between types and predicates, which has a small data size and contains all information needed for building SPARQL statements. Second, based on the type-predicate graph, we propose two algorithms to build query graphs with aggregation and transform the query graphs into SPARQL statements with aggregation. Finally, the experimental results over the benchmark datasets demonstrate that SAKQ can answer various general aggregate keyword queries. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:588 / 600
页数:13
相关论文
共 57 条
[1]   Getting answers from semantic repositories: a keywords-based approach [J].
Abad Navarro, Francisco ;
Fernandez Breis, Jesualdo Tomas .
PROCESAMIENTO DEL LENGUAJE NATURAL, 2018, (61) :39-46
[2]  
[Anonymous], P 19 INT C EXT DAT T
[3]  
[Anonymous], 2010, IEEE DATA ENG B
[4]  
[Anonymous], 2013, THESIS
[5]  
[Anonymous], 2010, Warburg effect in chemosensitivity: Targeting lactate dehydrogenase-A re-sensitizes Taxol-resistant cancer cells to Taxol
[6]   Left Bit Right: For SPARQL Join Queries with OPTIONAL Patterns (Left-outer-joins) [J].
Atre, Medha .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :1793-1808
[7]   Semantic similarity method for keyword query system on RDF [J].
Bae, Minho ;
Kang, Sanggil ;
Oh, Sangyoon .
NEUROCOMPUTING, 2014, 146 :264-275
[8]  
Bornea M. A., 2013, Proceedings of the 2013 international conference on Management of data, P121, DOI 10.1145/2463676.2463718
[9]   Discovering User Behavioral Features to Enhance Information Search on Big Data [J].
Cassavia, Nunziato ;
Masciari, Elio ;
Pulice, Chiara ;
Sacca, Domenico .
ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2017, 7 (02)
[10]   SearchLens: Composing and Capturing Complex User Interests for Exploratory Search [J].
Chang, Joseph Chee ;
Hahn, Nathan ;
Perer, Adam ;
Kittur, Aniket .
PROCEEDINGS OF IUI 2019, 2019, :498-509