Geospatial data conflation: a formal approach based on optimization and relational databases

被引:10
作者
Lei, Ting L. [1 ]
机构
[1] Univ Kansas, Dept Geog & Atmospher Sci, Lawrence, KS 66045 USA
基金
中国国家自然科学基金;
关键词
Data fusion; conflation; optimization; geographic information systems; relational Database; MATCHING METHOD;
D O I
10.1080/13658816.2020.1778001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Geospatial data conflation is aimed at matching counterpart features from two or more data sources in order to combine and better utilize information in the data. Due to the importance of conflation in spatial analysis, different approaches to the conflation problem have been proposed ranging from simple buffer-based methods to probability and optimization based models. In this paper, I propose a formal framework for conflation that integrates two powerful tools of geospatial computation: optimization and relational databases. I discuss the connection between the relational database theory and conflation, and demonstrate how the conflation process can be formulated and carried out in standard relational databases. I also propose a set of new optimization models that can be used inside relational databases to solve the conflation problem. The optimization models are based on the minimum cost circulation problem in operations research (also known as thenetwork flowproblem), which generalizes existing optimal conflation models that are primarily based on the assignment problem. Using comparable datasets, computational experiments show that the proposed conflation method is effective and outperforms existing optimal conflation models by a large margin. Given its generality, the new method may be applicable to other data types and conflation problems.
引用
收藏
页码:2296 / 2334
页数:39
相关论文
共 50 条
[41]   A Model Architecture for Big Data applications using Relational Databases [J].
Durham, Erin-Elizabeth A. ;
Rosen, Andrew ;
Harrison, Robert W. .
2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
[42]   Discovering and exploiting statistical properties for query optimization in relational databases: A survey [J].
Haas, Peter J. ;
Ilyas, Ihab F. ;
Lohman, Guy M. ;
Markl, Volker .
Statistical Analysis and Data Mining, 2009, 1 (04) :223-250
[43]   Fuzzy Query over Ontologies based on Relational Databases [J].
Tong, Qiang ;
Zhang, Fu ;
Cheng, Jingwei .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, :282-290
[44]   An Algorithm for Watermarking Relational Databases Based Genetic Algorithms [J].
Meng, Mailing .
2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, :1999-2002
[45]   An SQL-Based Declarative Process Mining Framework for Analyzing Process Data Stored in Relational Databases [J].
Riva, Francesco ;
Benvenuti, Dario ;
Maggi, Fabrizio Maria ;
Marrella, Andrea ;
Montali, Marco .
BUSINESS PROCESS MANAGEMENT FORUM, BPM 2023 FORUM, 2023, 490 :214-231
[46]   A Correlation-Preserving Fingerprinting Technique for Categorical Data in Relational Databases [J].
Sarcevic, Tanja ;
Mayer, Rudolf .
ICT SYSTEMS SECURITY AND PRIVACY PROTECTION, SEC 2020, 2020, 580 :401-415
[47]   Harnessing the power of relational databases for managing subsurface geotechnical and geologic data [J].
Veeger, AI ;
Murray, DP ;
Hermes, OD ;
Boothroyd, JC ;
Hamidzada, NA .
ENVIRONMENTAL & ENGINEERING GEOSCIENCE, 2004, 10 (04) :339-346
[48]   Large-scale integration of remotely sensed and GIS road networks: A full image-vector conflation approach based on optimization and deep learning [J].
Lei, Zhen ;
Lei, Ting L. .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2024, 113
[49]   A Semantic-Spatial Aware Data Conflation Approach for Place Knowledge Graphs [J].
He, Lianlian ;
Li, Hao ;
Zhang, Rui .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (04)
[50]   Audio Retrieval Based on Chinese Keyword Search in Relational Databases [J].
Zhu, Boyan ;
Liu, Guang ;
Zhu, Liang .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, :634-637