Pair Hidden Markov Model for Named Entity Matching

被引:1
|
作者
Nabende, Peter [1 ]
Tiedemann, Jorg [1 ]
Nerbonne, John [1 ]
机构
[1] Univ Groningen, Dept Computat Linguist, Ctr Language & Cognit Groningen, NL-9700 AB Groningen, Netherlands
来源
INNOVATIONS AND ADVANCES IN COMPUTER SCIENCES AND ENGINEERING | 2010年
关键词
Named entity; Similarity Measurement; Hidden Markov Model; pair-Hidden Markov Model;
D O I
10.1007/978-90-481-3658-2_87
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper introduces a pair-Hidden Markov Model (pair-HMM) for the task of evaluating the similarity between bilingual named entities. The pair-HMM is adapted from Mackay and Kondrak [1] who used it on the task of cognate identification and was later adapted by Wieling et al. [5] for Dutch dialect comparison. When using the pair-HMM for evaluating named entities, we do not consider the phonetic representation step as is the case with most named-entity similarity measurement systems. We instead consider the original orthographic representation of the input data and introduce into the pair-HMM representation for diacritics or accents to accommodate for pronunciation variations in the input data. We have first adapted the pair-HMM on measuring the similarity between named entities from languages (French and English) that use the same writing system (the Roman alphabet) and languages (English and Russian) that use a different writing system. The results are encouraging as we propose to extend the pair-HMM to more application oriented named-entity recognition and generation tasks.
引用
收藏
页码:497 / 502
页数:6
相关论文
共 50 条
  • [41] Financial Sequences and the Hidden Markov Model
    Sengupta, Shreeya
    Wang, Hui
    Blackburn, William
    Ojha, Piyush
    GLOBAL TRENDS IN INFORMATION SYSTEMS AND SOFTWARE APPLICATIONS, PT 2, 2012, 270 : 5 - 12
  • [42] Integrating various features in hidden Markov model using constraint relaxation algorithm for recognition of named entities without gazetteers
    Zhou, GD
    Su, J
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 465 - 470
  • [43] Estimating the order of a hidden Markov model
    MacKay, RJ
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2002, 30 (04): : 573 - 589
  • [44] Hidden Markov model with missing emissions
    Karima Elkimakh
    Abdelaziz Nasroallah
    Computational Statistics, 2024, 39 : 385 - 403
  • [45] A navigation path and high-definition map matching scheme based on improved hidden Markov model
    Liu, Haiyan
    Wang, Kunfeng
    Wang, Yadong
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 51 - 55
  • [46] A Hidden Markov Model-Based Map Matching Algorithm for Low Sampling Rate Trajectory Data
    Hu, Yigong
    Lu, Binbin
    IEEE ACCESS, 2019, 7 : 178235 - 178245
  • [47] Splice sites detection by combining Markov and hidden Markov model
    Zhang, Quanwei
    Peng, Qinke
    Li, Kankan
    Kang, Xuejiao
    Li, Jing
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 1531 - +
  • [48] A coarse-grained Markov chain is a hidden Markov model
    MacDonald, Iain L.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 541
  • [49] Map-Matching Using Hidden Markov Model and Path Choice Preferences under Sparse Trajectory
    Xiong, Zhengang
    Li, Bin
    Liu, Dongmei
    SUSTAINABILITY, 2021, 13 (22)
  • [50] Hidden Markov map matching based on trajectory segmentation with heading homogeneity
    Cui, Ge
    Bian, Wentao
    Wang, Xin
    GEOINFORMATICA, 2021, 25 (01) : 179 - 206