Advanced Semantics for Commonsense Knowledge Extraction

被引:21
作者
Nguyen, Tuan-Phong [1 ]
Razniewski, Simon [1 ]
Weikum, Gerhard [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
来源
PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) | 2021年
关键词
D O I
10.1145/3442381.3449827
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Commonsense knowledge (CSK) about concepts and their properties is useful for AI applications such as robust chatbots. Prior works like ConceptNet, TupleKB and others compiled large CSK collections, but are restricted in their expressiveness to subject-predicate-object (SPO) triples with simple concepts for S and monolithic strings for P and O. Also, these projects have either prioritized precision or recall, but hardly reconcile these complementary goals. This paper presents a methodology, called Ascent, to automatically build a large-scale knowledge base (KB) of CSK assertions, with advanced expressiveness and both better precision and recall than prior works. Ascent goes beyond triples by capturing composite concepts with subgroups and aspects, and by refining assertions with semantic facets. The latter are important to express temporal and spatial validity of assertions and further qualifiers. Ascent combines open information extraction with judicious cleaning using language models. Intrinsic evaluation shows the superior size and quality of the Ascent KB, and an extrinsic evaluation for QA-support tasks underlines the benefits of Ascent. A web interface, data and code can be found at https://www.mpi-inf.mpg.de/ascent.
引用
收藏
页码:2636 / 2647
页数:12
相关论文
共 61 条
[1]  
[Anonymous], 2019, CIKM, DOI DOI 10.1145/3357384.3357955
[2]  
[Anonymous], 2004, P 13 INT C WORLD WID, DOI DOI 10.1145/988672.988687
[3]  
[Anonymous], 2020, ICML
[4]  
Bhakthavatsalam Sumithra, 2020, ARXIV200607510
[5]  
Bhakthavatsalam Sumithra, 2020, ARXIV200500660
[6]  
Brown TB, 2020, ADV NEUR IN, V33
[7]  
Cetto M., 2018, P 27 INT C COMP LING, P2300
[8]  
Chalier Y., 2020, AKBC
[9]  
Chen Tongfei, 2020, ACL
[10]  
Clark Peter, 2018, CORR