Multi-task deep learning model based on hierarchical relations of address elements for semantic address matching

被引:0
作者
Fangfang Li
Yiheng Lu
Xingliang Mao
Junwen Duan
Xiyao Liu
机构
[1] Central South University,School of Computer Science and Engineering
[2] Hunan University of Technology and Business,Institute of Big Data and Internet Innovation
来源
Neural Computing and Applications | 2022年 / 34卷
关键词
Address matching; Multi-task learning; Recognizes the address elements; Hierarchical relations between address elements;
D O I
暂无
中图分类号
学科分类号
摘要
Address matching, which aims to match unstructured addresses with standard addresses in an address database, is a key part of geocoding. The core problem of address matching corresponds to text matching in natural language processing. Existing rule-based methods require human-designed templates and thus, have limited applicability. Machine learning and deep learning-based methods ignore the hierarchical relations between address elements, which easily misclassify semantically similar but geographically different locations. We note that the hierarchy of address elements can fill the semantic gap in address matching. Inspired by how humans discriminate addresses, we propose a multi-task learning approach. The approach jointly recognises the address elements and matches the addresses to incorporate the hierarchical relations between the address elements into the neural network. Simultaneously, we introduce a priori information on the hierarchical relationship of address elements through the conditional random field model. Experimental results on the benchmark datasets Shenzhen Address Database and Jiangsu-Hunan Address Dataset demonstrate the effectiveness of our approach. We achieved state-of-the-art F1 scores (i.e. the harmonic mean of precision and recall) of 99.0 and 94.2 on the two datasets, respectively.
引用
收藏
页码:8919 / 8931
页数:12
相关论文
共 84 条
[1]  
Drummond WJ(1995)Address matching: GIS technology for mapping human activity patterns J Am Plann Assoc 61 240-251
[2]  
Edwards SE(2014)Geocoding large population-level administrative datasets at highly resolved spatial scales Trans GIS 18 586-603
[3]  
Strauss B(2021)A survey on multi-task learning IEEE Trans Knowl Data Eng 5 65-16
[4]  
Miranda ML(2016)Using an optimized Chinese address matching method to develop a geocoding service: a case study of Shenzhen, China ISPRS Int J Geo-Inf 10 1-938
[5]  
Zhang Y(2018)Experience: enhancing address matching with geocoding and similarity measure selection J Data Inf Qual (JDIQ) 11 913-921
[6]  
Yang Q(2018)Learning to combine multiple string similarity metrics for effective toponym matching Int J Digit Earth 18 912-848
[7]  
Tian Q(2020)CNN-RNN based intelligent recommendation for online medical pre-diagnosis support IEEE/ACM Trans Comput Biol Bioinform 31 825-1136
[8]  
Ren F(2017)Sensing spatial distribution of urban land use by integrating points-of-interest and Google Word2Vec model Int J Geogr Inf Sci 34 1117-2053
[9]  
Hu T(2020)Fine-grained landuse characterization using ground-based pictures: a deep learning solution based on globally available data Int J Geogr Inf Sci 32 2037-734
[10]  
Liu J(2020)Intelligent traffic monitoring and traffic diagnosis analysis based on neural network algorithm Neural Comput Appl 34 708-348