Pair Hidden Markov Model for Named Entity Matching

被引:1
|
作者
Nabende, Peter [1 ]
Tiedemann, Jorg [1 ]
Nerbonne, John [1 ]
机构
[1] Univ Groningen, Dept Computat Linguist, Ctr Language & Cognit Groningen, NL-9700 AB Groningen, Netherlands
关键词
Named entity; Similarity Measurement; Hidden Markov Model; pair-Hidden Markov Model;
D O I
10.1007/978-90-481-3658-2_87
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper introduces a pair-Hidden Markov Model (pair-HMM) for the task of evaluating the similarity between bilingual named entities. The pair-HMM is adapted from Mackay and Kondrak [1] who used it on the task of cognate identification and was later adapted by Wieling et al. [5] for Dutch dialect comparison. When using the pair-HMM for evaluating named entities, we do not consider the phonetic representation step as is the case with most named-entity similarity measurement systems. We instead consider the original orthographic representation of the input data and introduce into the pair-HMM representation for diacritics or accents to accommodate for pronunciation variations in the input data. We have first adapted the pair-HMM on measuring the similarity between named entities from languages (French and English) that use the same writing system (the Roman alphabet) and languages (English and Russian) that use a different writing system. The results are encouraging as we propose to extend the pair-HMM to more application oriented named-entity recognition and generation tasks.
引用
收藏
页码:497 / 502
页数:6
相关论文
共 50 条
  • [31] Pair hidden Markov models on tree structures
    Sakakibara, Yasubumi
    BIOINFORMATICS, 2003, 19 : i232 - i240
  • [32] Efficient implementation of a generalized pair hidden Markov model for comparative gene finding
    Majoros, WH
    Pertea, M
    Salzberg, SL
    BIOINFORMATICS, 2005, 21 (09) : 1782 - 1788
  • [33] Character Gazetteer for Named Entity Recognition with Linear Matching Complexity
    Dlugolinsky, Stefan
    Nguyen, Giang
    Laclavik, Michal
    Seleng, Martin
    2013 THIRD WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2013, : 361 - 365
  • [34] Improving Disease Named Entity Recognition for Clinical Trial Matching
    Khan, Md Abdullah Al Hafiz
    Shamsuzzaman, Md
    Hasan, Sadid A.
    Sorower, Mohammad S.
    Liu, Joey
    Datla, Vivek
    Milosevic, Mladen
    Mankovich, Gabe
    van Ommering, Rob
    Dimitrova, Nevenka
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2541 - 2548
  • [35] Fast Hidden Markov Model Map-Matching for Sparse and Noisy Trajectories
    Koller, Hannes
    Widhalm, Peter
    Dragaschnig, Melitta
    Graser, Anita
    2015 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, : 2557 - 2561
  • [36] Map-matching approach based on link factor and hidden Markov model
    Zhang, Haiyan
    Luo, Yonglong
    Yu, Qingying
    Zheng, Xiaoyao
    Li, Xuejing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5455 - 5471
  • [37] Named Entity-based Question-Answering Pair Generator
    Lahiri, Aritra Kumar
    Hu, Qinmin Vivian
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4902 - 4906
  • [38] Markov Financial Model Using Hidden Markov Model
    Luc Tri Tuyen
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2013, 40 (10): : 72 - 83
  • [39] MEASURING PAIRWISE SIMILARITY OF TANDEM MASS SPECTRA USING PAIR HIDDEN MARKOV MODEL
    Ryu, So Young
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2018, : 204 - 209
  • [40] Parameter estimation in pair-hidden Markov models
    Arribas-Gil, Ana
    Gassiat, Elisabeth
    Matias, Catherine
    SCANDINAVIAN JOURNAL OF STATISTICS, 2006, 33 (04) : 651 - 671