Natural language aggregate query over RDF data

被引:15
作者
Hu, Xin [1 ]
Dang, Depeng [1 ]
Yao, Yingting [1 ]
Ye, Luting [1 ]
机构
[1] Beijing Normal Univ, Coll Informat Sci & Technol, Beijing 100875, Peoples R China
基金
中国国家自然科学基金;
关键词
RDF; Question answering; Natural language; Aggregate query; INTERFACE; SEARCH;
D O I
10.1016/j.ins.2018.04.042
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural language question/answering over RDF (Resource Description Framework) data has received widespread attention. Although several studies can address a small number of aggregate queries, these studies have many restrictions (e.g., interactive information, controlled questions or query templates). Thus far, there has been no natural language querying mechanism that can process general aggregate queries over RDF data. Therefore, we propose a framework called NLAQ (Natural Language Aggregate Query). First, we propose a novel algorithm to automatically understand a user's query intention, which primarily contains semantic relations and aggregations. Second, to build a better bridge between the query intention and RDF data, we propose an extended paraphrase dictionary ED to obtain more candidate mappings for semantic relations, and we introduce a predicate-type adjacent set PT to filter out inappropriate candidate mapping combinations in semantic relations and basic graph patterns. Third, we design a suitable translation plan for each aggregate category and effectively distinguish whether an aggregate item is numeric, which will greatly affect the aggregate result. Finally, we conduct extensive experiments over real datasets (QALD benchmark and DBpedia). The experimental results demonstrate that our solution is effective. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:363 / 381
页数:19
相关论文
共 48 条
[1]  
Agarwal MK., 2016, EDBT, P149
[2]   Natural Language Interface to Relational Database (NLI-RDB) Through Object Relational Mapping (ORM) [J].
Alghamdi, Abdullah ;
Owda, Majdi ;
Crockett, Keeley .
ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, 2017, 513 :449-464
[3]  
Amsterdamer Y, 2015, PROC VLDB ENDOW, V8, P1430
[4]   NL2CM: A Natural Language Interface to Crowd Mining [J].
Amsterdamer, Yael ;
Kukliansky, Anna ;
Milo, Tova .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :1433-1438
[5]  
[Anonymous], 2012, P 21 INT C WORLD WID
[6]  
[Anonymous], P EDBT BORD FRANC
[7]  
[Anonymous], 2011, P 2011 C EMPIRICAL M
[8]  
Bais H., 2016, INFORM TECHNOLOGY OR, P1
[9]   K-Extractor: Automatic Knowledge Extraction for Hybrid Question Answering [J].
Balakrishna, Mithun ;
Werner, Steven ;
Tatu, Marta ;
Erekhinskaya, Tatiana ;
Moldovan, Dan .
2016 IEEE TENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2016, :389-390
[10]   A general framework to resolve the MisMatch problem in XML keyword search [J].
Bao, Zhifeng ;
Zeng, Yong ;
Ling, Tok Wang ;
Zhang, Dongxiang ;
Li, Guoliang ;
Jagadish, H. V. .
VLDB JOURNAL, 2015, 24 (04) :493-518