Geospatial data conflation: a formal approach based on optimization and relational databases

被引:10
作者
Lei, Ting L. [1 ]
机构
[1] Univ Kansas, Dept Geog & Atmospher Sci, Lawrence, KS 66045 USA
基金
中国国家自然科学基金;
关键词
Data fusion; conflation; optimization; geographic information systems; relational Database; MATCHING METHOD;
D O I
10.1080/13658816.2020.1778001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Geospatial data conflation is aimed at matching counterpart features from two or more data sources in order to combine and better utilize information in the data. Due to the importance of conflation in spatial analysis, different approaches to the conflation problem have been proposed ranging from simple buffer-based methods to probability and optimization based models. In this paper, I propose a formal framework for conflation that integrates two powerful tools of geospatial computation: optimization and relational databases. I discuss the connection between the relational database theory and conflation, and demonstrate how the conflation process can be formulated and carried out in standard relational databases. I also propose a set of new optimization models that can be used inside relational databases to solve the conflation problem. The optimization models are based on the minimum cost circulation problem in operations research (also known as thenetwork flowproblem), which generalizes existing optimal conflation models that are primarily based on the assignment problem. Using comparable datasets, computational experiments show that the proposed conflation method is effective and outperforms existing optimal conflation models by a large margin. Given its generality, the new method may be applicable to other data types and conflation problems.
引用
收藏
页码:2296 / 2334
页数:39
相关论文
共 50 条
[31]   SQL-based semantics for path expressions over hierarchical data in relational databases [J].
Vainio, Johanna ;
Junkkari, Marko .
JOURNAL OF INFORMATION SCIENCE, 2014, 40 (03) :293-312
[32]   Dynamic result optimization for keyword search over relational databases [J].
Lin, Zi-Yu ;
Zou, Quan ;
Lai, Yong-Xuan ;
Lin, Chen .
Ruan Jian Xue Bao/Journal of Software, 2014, 25 (03) :528-546
[33]   Ingestion of a Data Lake into a NoSQL Data Warehouse: The Case of Relational Databases [J].
Abdelhedi, Fatma ;
Jemmali, Rym ;
Zurfluh, Gilles .
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KMIS), VOL 3, 2021, :64-72
[34]   An Integrative Approach to Geospatial Data Fusion [J].
Stankute, Silvija ;
Asche, Hartmut .
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2009, PT I, 2009, 5592 :490-504
[35]   A Hybrid Approach for Relating OWL 2 Ontologies and Relational Databases [J].
Vysniauskas, Ernestas ;
Nemuraite, Lina ;
Sukys, Algirdas .
PERSPECTIVES IN BUSINESS INFORMATICS RESEARCH, 2010, 64 :86-101
[36]   A Novel and Complete Approach for Storing RDF(S) in Relational Databases [J].
Zhang, Fu ;
Tong, Qiang ;
Cheng, Jingwei .
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (05) :894-903
[37]   A New Approach to the Equivalence of Relational and Object-Oriented Databases [J].
Lebiediewa, Swietlana ;
Zarzycki, Hubert ;
Dobrosielski, Wojciech T. .
NOVEL DEVELOPMENTS IN UNCERTAINTY REPRESENTATION AND PROCESSING: ADVANCES IN INTUITIONISTIC FUZZY SETS AND GENERALIZED NETS, 2016, 401 :85-93
[38]   AN EFFECTIVE APPROACH TO VERTICAL PARTITIONING FOR PHYSICAL DESIGN OF RELATIONAL DATABASES [J].
CORNELL, DW ;
YU, PS .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1990, 16 (02) :248-258
[39]   Transforming Fuzzy Spatiotemporal Data From Relational Databases to XML [J].
Li, Nan ;
Bai, Luyi .
IEEE ACCESS, 2018, 6 :4176-4185
[40]   XTRON: An XML data management system using relational databases [J].
Min, Jun-Ki ;
Lee, Chun-Hee ;
Chung, Chin-Wan .
INFORMATION AND SOFTWARE TECHNOLOGY, 2008, 50 (05) :462-479