Data integration for earthquake disaster using real-world data

被引:0
作者
Tian, Chuanzhao [1 ,2 ]
Li, Guoqing [1 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Data integration; Earthquake disaster; Numeric data; Entity resolution; ENTITY RESOLUTION; RECORD LINKAGE;
D O I
10.1007/s11600-019-00381-4
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The purpose of entity resolution (ER) is to identify records that refer to the same real-world entity from different sources. Most traditional ER studies identify records based on string-based data, so the ER problem relies mostly on string comparison techniques. There is little research on numeric-based data. Traditional ER approaches are widely used in many domains, such as papers, gene sequencing and restaurants, but they have not been used in an earthquake disaster. In this paper, earthquake disaster event information that was collected from different websites is denoted with numeric data. To solve the problem of ER in numeric data, we use the following methods to conduct experiments. First, we treat numbers as strings and use string-based approaches. Second, we use the Euclidean distance to measure the difference between two records. Third, we combine the above two strategies and use a comprehensive approach to measure the distance between the two records. We experimentally evaluate our methods on real datasets that represent earthquake disaster event information. The experimental results show that a comprehensive approach can achieve high performance.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 50 条
[31]   Anchoring the Consistency Dimension of Data Quality Using Ontology in Data Integration [J].
Chen Wei-Liang ;
Zhang Shi-Dong ;
Gao Xiang .
2009 SIXTH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2009, :201-+
[32]   Data Management Platform for Simplifying Application Integration by Using Context of Data [J].
Moribe H. ;
Koizumi M. ;
Igarashi Y. ;
Tsuno S. ;
Tajima Y. .
IEEJ Transactions on Electronics, Information and Systems, 2021, 141 (12) :1462-1471
[33]   Data integration in data warehousing [J].
Calvanese, D ;
De Giacomo, G ;
Lenzerini, M ;
Nardi, D ;
Rosati, R .
INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2001, 10 (03) :237-271
[34]   A Data Model for Data Integration [J].
Lu, James J. .
ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2006, 150 (02) :3-19
[35]   Diagnostic Data Integration Using Deep Neural Networks for Real-Time Plasma Analysis [J].
Garola, A. Rigoni ;
Cavazzana, R. ;
Gobbin, M. ;
Delogu, R. S. ;
Manduchi, G. ;
Taliercio, C. ;
Luchetta, A. .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2021, 68 (08) :2165-2172
[36]   Populating Entity Name Systems for Big Data Integration [J].
Kejriwal, Mayank .
SEMANTIC WEB - ISWC 2014, PT II, 2014, 8797 :521-528
[37]   A Real-World Application of Process Mining for Data-Driven Analysis of Multi-Level Interlinked Manufacturing Processes [J].
Birk, Alexander ;
Wilhelm, Yannick ;
Dreher, Simon ;
Flack, Christian ;
Reimann, Peter ;
Groeger, Christoph .
54TH CIRP CONFERENCE ON MANUFACTURING SYSTEMS 2021-TOWARDS DIGITALIZED MANUFACTURING 4.0, CMS 2021, 2021, 104 :417-422
[38]   Using an Inference Mechanism for Helping the Data Integration [J].
Pequeno, Valeria Magalhaes ;
Moura Pires, Joao Carlos .
PROCEEDINGS OF THE 15TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '11), 2011, :251-253
[39]   Using Signifiers for Data Integration in Rail Automation [J].
Wurl, Alexander ;
Falkner, Andreas ;
Haselbock, Alois ;
Mazak, Alexandra .
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2017, :172-179
[40]   OSS Data Integration using Virtual Database [J].
Take, Naoki ;
Nishio, Manabu ;
Seshake, Hikaru .
2013 15TH ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2013,