Knowledge graph mining for realty domain using dependency parsing and QAT models

被引:1
作者
Zamiralov, Alexander [1 ]
Sohin, Timur [1 ]
Butakov, Nikolay [1 ]
机构
[1] ITMO Univ, 49 Kronverksky Pr, St Petersburg 197101, Russia
来源
10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021) | 2021年 / 193卷
基金
俄罗斯科学基金会;
关键词
ontology; knowledge-graph; QAT; neural network; dependency parsing; real estates; ONTOLOGY;
D O I
10.1016/j.procs.2021.10.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The real estate business has a lot of risks, and in order to minimize them, you need a lot of information from different sources. Systems based on natural language processing can help customers find this information more easily: question answering, information retrieval, etc. The existing method of question answering requires data aligned with possible questions, which are not easy to obtain, in contrast, the knowledge-graph provides structured information. In this paper, we propose semi-automated ontology generation for the realty domain and a subsequent method for information retrieval related to the knowledge-graph of this ontology. The first contribution is the method for relation extraction method based on dependency-parsing and semantic similarity evaluation, which allows us to form ontology for a particular domain. The second contribution is knowledge-graph completion method based on question answering over text neural network. Our experimental analysis shows the efficiency of the proposed approaches. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:32 / 41
页数:10
相关论文
共 16 条
  • [1] Bonino D, 2008, LECT NOTES COMPUT SC, V5318, P790, DOI 10.1007/978-3-540-88564-1_51
  • [2] Unified domain-specific language for collecting and processing data of social media
    Butakov, Nikolay
    Petrov, Maxim
    Mukhina, Ksenia
    Nasonov, Denis
    Kovalchuk, Sergey
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 51 (02) : 389 - 414
  • [3] Chaves M., 2012, HONTOLOGY MULTILINGU
  • [4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [5] SpanBERT: Improving Pre-training by Representing and Predicting Spans
    Joshi, Mandar
    Chen, Danqi
    Liu, Yinhan
    Weld, Daniel S.
    Zettlemoyer, Luke
    Levy, Omer
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 64 - 77
  • [6] Liu Y, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3730
  • [7] An ontology-based multicriteria spatial decision support system: a case study of house selection
    Malczewski, Jacek
    Jelokhani-Niaraki, Mohammadreza
    [J]. GEO-SPATIAL INFORMATION SCIENCE, 2012, 15 (03) : 177 - 185
  • [8] Montani Ines, 2023, Zenodo
  • [9] Rajpurkar P., 2016, ARXIV
  • [10] Reimers N, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3982