Unsupervised Descriptive Text Mining for Knowledge Graph Learning

被引:8
作者
Frisoni, Giacomo [1 ]
Moro, Gianluca [1 ]
Carbonaro, Antonella [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn DISI, Via Univ 50, I-47522 Cesena, Italy
来源
PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1 | 2020年
关键词
Text Mining; Knowledge Graphs; Unsupervised Learning; Semantic Web; Ontology Learning; Rare Diseases; ACQUISITION; WEB;
D O I
10.5220/0010153603160324
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of knowledge graphs (KGs) in advanced applications is constantly growing, as a consequence of their ability to model large collections of semantically interconnected data. The extraction of relational facts from plain text is currently one of the main approaches for the construction and expansion of KGs. In this paper, we introduce a novel unsupervised and automatic technique of KG learning from corpora of short unstructured and unlabeled texts. Our approach is unique in that it starts from raw textual data and comes to: i) identify a set of relevant domain-dependent terms; ii) extract aggregate and statistically significant semantic relationships between terms, documents and classes; iii) represent the accurate probabilistic knowledge as a KG; iv) extend and integrate the KG according to the Linked Open Data vision. The proposed solution is easily transferable to many domains and languages as long as the data are available. As a case study, we demonstrate how it is possible to automatically learn a KG representing the knowledge contained within the conversational messages shared on social networks such as Facebook by patients with rare diseases, and the impact this can have on creating resources aimed to capture the "voice of patients".
引用
收藏
页码:316 / 324
页数:9
相关论文
共 36 条
  • [1] Allen Carl, 2019, ARXIV PREPRINT ARXIV
  • [2] [Anonymous], 2005, ONTOLOGY LEARNING TE
  • [3] A survey of ontology learning techniques and applications
    Asim, Muhammad Nabeel
    Wasim, Muhammad
    Khan, Muhammad Usman Ghani
    Mahmood, Waqar
    Abbasi, Hafiza Mahnoor
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
  • [4] DBpedia - A crystallization point for the Web of Data
    Bizer, Christian
    Lehmann, Jens
    Kobilarov, Georgi
    Auer, Soeren
    Becker, Christian
    Cyganiak, Richard
    Hellmann, Sebastian
    [J]. JOURNAL OF WEB SEMANTICS, 2009, 7 (03): : 154 - 165
  • [5] Bollacker K, 2008, P 2008 ACM SIGMOD IN, P1247, DOI DOI 10.1145/1376616.1376746
  • [6] Bosselut A, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P4762
  • [7] Carbonaro A., 2018, J E-LEARN KNOWL SOC
  • [8] Carbonaro A, 2012, J E-LEARN KNOWL SOC, V8, P33
  • [9] Domeniconi Giacomo, 2014, 6th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2014). Proceedings, P107
  • [10] Domeniconi Giacomo, 2014, 6th International Conference on Knowledge Discovery and Information Retrieval (KDIR 2014). Proceedings, P31