Data integration for earthquake disaster using real-world data

被引:0
作者
Tian, Chuanzhao [1 ,2 ]
Li, Guoqing [1 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Data integration; Earthquake disaster; Numeric data; Entity resolution; ENTITY RESOLUTION; RECORD LINKAGE;
D O I
10.1007/s11600-019-00381-4
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The purpose of entity resolution (ER) is to identify records that refer to the same real-world entity from different sources. Most traditional ER studies identify records based on string-based data, so the ER problem relies mostly on string comparison techniques. There is little research on numeric-based data. Traditional ER approaches are widely used in many domains, such as papers, gene sequencing and restaurants, but they have not been used in an earthquake disaster. In this paper, earthquake disaster event information that was collected from different websites is denoted with numeric data. To solve the problem of ER in numeric data, we use the following methods to conduct experiments. First, we treat numbers as strings and use string-based approaches. Second, we use the Euclidean distance to measure the difference between two records. Third, we combine the above two strategies and use a comprehensive approach to measure the distance between the two records. We experimentally evaluate our methods on real datasets that represent earthquake disaster event information. The experimental results show that a comprehensive approach can achieve high performance.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 50 条
[21]   Data integration using service composition in data service middleware [J].
Gannouni, Sofien ;
Beraka, Mutaz ;
Mathkour, Hassan .
SECURITY AND COMMUNICATION NETWORKS, 2014, 7 (11) :2134-2144
[22]   Data integration to prioritize drugs using genomics and curated data [J].
Riku Louhimo ;
Marko Laakso ;
Denis Belitskin ;
Juha Klefström ;
Rainer Lehtonen ;
Sampsa Hautaniemi .
BioData Mining, 9
[23]   Dynamic integration of biological data sources using the data concierge [J].
Gong P. .
Health Information Science and Systems, 1 (1)
[24]   Process and Future of Data Integration within the European Earthquake Engineering Laboratories [J].
Martinez, Ignacio Lamata ;
Ioannidis, Ioannis ;
Pegon, Pierre ;
Williams, Martin S. ;
Blakeborough, Anthony .
JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2014, 28 (03)
[25]   Data Integration using Machine Learning [J].
Birgersson, Marcus ;
Hansson, Gustav ;
Franke, Ulrik .
2016 IEEE 20TH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING WORKSHOP (EDOCW), 2016, :313-322
[26]   Data Integration in ETL Using TALEND [J].
Sreemathy, J. ;
Joseph, Infant, V ;
Nisha, S. ;
Prabha, Chaaru, I ;
Priya, Gokula R. M. .
2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, :1444-1448
[27]   Incremental entity resolution process over query results for data integration systems [J].
Machado Vieira, Priscilla Kelly ;
Loscio, Bernadette Farias ;
Salgado, Ana Carolina .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2019, 52 (02) :451-471
[28]   Incremental entity resolution process over query results for data integration systems [J].
Priscilla Kelly Machado Vieira ;
Bernadette Farias Lóscio ;
Ana Carolina Salgado .
Journal of Intelligent Information Systems, 2019, 52 :451-471
[29]   Research on Integration of Spatial-Data and Business-Data in Disaster Emergency Management System based GIS [J].
Hu Feihu ;
Chen Huimin ;
Chen Ting ;
Zhang Zhi .
ICPOM2008: PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE OF PRODUCTION AND OPERATION MANAGEMENT, VOLUMES 1-3, 2008, :654-657
[30]   Integration of graphs from different data sources using crowdsourcing [J].
Kim, Younghoon ;
Jung, Woohwan ;
Shim, Kyuseok .
INFORMATION SCIENCES, 2017, 385 :438-456