Data integration for earthquake disaster using real-world data

被引:0
作者
Tian, Chuanzhao [1 ,2 ]
Li, Guoqing [1 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Data integration; Earthquake disaster; Numeric data; Entity resolution; ENTITY RESOLUTION; RECORD LINKAGE;
D O I
10.1007/s11600-019-00381-4
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The purpose of entity resolution (ER) is to identify records that refer to the same real-world entity from different sources. Most traditional ER studies identify records based on string-based data, so the ER problem relies mostly on string comparison techniques. There is little research on numeric-based data. Traditional ER approaches are widely used in many domains, such as papers, gene sequencing and restaurants, but they have not been used in an earthquake disaster. In this paper, earthquake disaster event information that was collected from different websites is denoted with numeric data. To solve the problem of ER in numeric data, we use the following methods to conduct experiments. First, we treat numbers as strings and use string-based approaches. Second, we use the Euclidean distance to measure the difference between two records. Third, we combine the above two strategies and use a comprehensive approach to measure the distance between the two records. We experimentally evaluate our methods on real datasets that represent earthquake disaster event information. The experimental results show that a comprehensive approach can achieve high performance.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 50 条
[41]   Official Statistics Data Integration Using Copulas [J].
Dalla Valle, Luciana .
QUALITY TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2014, 11 (01) :111-131
[42]   Semantic integration of relational data using SPARQL [J].
Wang, Jinpeng ;
Miao, Zhuang ;
Zhang, Yafei ;
Lu, Jianjiang .
2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, :422-426
[43]   A Design of Data Integration Using Cloud Computing [J].
Geng, Yushui ;
Kou, Jisong .
ADVANCES IN FUTURE COMPUTER AND CONTROL SYSTEMS, VOL 2, 2012, 160 :415-419
[44]   Improving Model and Data Integration Using MOSAIC as Central Data Management Platform [J].
Kraus, Robert ;
Fillinger, Sandra ;
Tolksdorf, Gregor ;
Minh, Duc H. ;
Merchan-Restrepo, Victor A. ;
Wozny, Guenter .
CHEMIE INGENIEUR TECHNIK, 2014, 86 (07) :1130-1136
[45]   Biomedical data integration: using XML to link clinical and research data sets [J].
Berman, JJ ;
Bhatia, K .
EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2005, 5 (03) :329-336
[46]   Capturing Enterprise Data Integration Challenges Using a Semiotic Data Quality Framework [J].
John Krogstie .
Business & Information Systems Engineering, 2015, 57 :27-36
[47]   Capturing Enterprise Data Integration Challenges Using a Semiotic Data Quality Framework [J].
Krogstie, John .
BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2015, 57 (01) :27-36
[48]   Data Integration GeoService: A First Proposed Approach Using Historical Geographic Data [J].
Grosso, Eric ;
Bouju, Alain ;
Mustiere, Sebastien .
WEB AND WIRELESS GEOGRAPHICAL INFORMATION SYSTEMS, PROCEEDINGS, 2009, 5886 :103-+
[49]   Real-Time Gameplay Data and Biometric Measurement Integration as a Data Source for Game User Research [J].
Balcerzak, Adam ;
Laczynski, Marcin ;
Hufschmitt, Aline ;
Gackowski, Tomasz .
DIGITAL INTERACTION AND MACHINE INTELLIGENCE, MIDI 2023, 2024, 1076 :144-156
[50]   Shop floor data integration Data integration layer for Manufacturing IT [J].
Goertz, Dennis ;
Hahnen, Frank ;
Hanisch, Felix ;
Hauer, Markus ;
Neugebauer, Torsten .
ATP MAGAZINE, 2022, (6-7) :90-96