Pair Hidden Markov Model for Named Entity Matching

被引:1
|
作者
Nabende, Peter [1 ]
Tiedemann, Jorg [1 ]
Nerbonne, John [1 ]
机构
[1] Univ Groningen, Dept Computat Linguist, Ctr Language & Cognit Groningen, NL-9700 AB Groningen, Netherlands
关键词
Named entity; Similarity Measurement; Hidden Markov Model; pair-Hidden Markov Model;
D O I
10.1007/978-90-481-3658-2_87
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper introduces a pair-Hidden Markov Model (pair-HMM) for the task of evaluating the similarity between bilingual named entities. The pair-HMM is adapted from Mackay and Kondrak [1] who used it on the task of cognate identification and was later adapted by Wieling et al. [5] for Dutch dialect comparison. When using the pair-HMM for evaluating named entities, we do not consider the phonetic representation step as is the case with most named-entity similarity measurement systems. We instead consider the original orthographic representation of the input data and introduce into the pair-HMM representation for diacritics or accents to accommodate for pronunciation variations in the input data. We have first adapted the pair-HMM on measuring the similarity between named entities from languages (French and English) that use the same writing system (the Roman alphabet) and languages (English and Russian) that use a different writing system. The results are encouraging as we propose to extend the pair-HMM to more application oriented named-entity recognition and generation tasks.
引用
收藏
页码:497 / 502
页数:6
相关论文
共 50 条
  • [41] Named entity translation matching and learning: With application for mining unseen translations
    Lam, Wai
    Chan, Shing-Kit
    Huang, Ruizhang
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2007, 25 (01)
  • [42] A hybrid model for Chinese named entity recognition
    Sun, Xiao
    Huang, Degen
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
  • [43] Improving Rule-Based Name Entity Recognition System by Using Hidden Markov Model
    Dong, Junlin
    Zhang, Chunlu
    2008 INTERNATIONAL WORKSHOP ON INFORMATION TECHNOLOGY AND SECURITY, 2008, : 285 - 288
  • [44] A Named Entity Based Approach to Model Recipes
    Diwan, Nirav
    Batra, Devansh
    Bagler, Ganesh
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2020), 2020, : 88 - 93
  • [45] Named Entity Recognition Model for Polish Books
    Sopyla, Krzysztof
    Drozda, Pawel
    Ropiak, Krzysztof
    Witkowska, Urszula
    Sieniewicz, Malgorzata
    Jankowski, Sebastian
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, ACIIDS 2024, 2024, 14795 : 147 - 158
  • [46] Cross-domain Named Entity Recognition via Graph Matching
    Zheng, Junhao
    Chen, Haibin
    Ma, Qianli
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2670 - 2680
  • [47] Person Entity Recognition for the Indonesian Qur'an Translation with the Approach Hidden Markov Model-Viterbi
    Syachrul, R. M. M. A. K.
    Bijaksana, Moch Arif
    Huda, Arief Fatchul
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 214 - 220
  • [48] A Residual BiLSTM Model for Named Entity Recognition
    Yang, Gang
    Xu, Hongzhe
    IEEE ACCESS, 2020, 8 : 227710 - 227718
  • [49] A Neural Model for Unsupervised Named Entity Classification
    St. Chifu, Emil
    Chifu, Viorica R.
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 1077 - 1082
  • [50] MAF: A General Matching and Alignment Framework for Multimodal Named Entity Recognition
    Xu, Bo
    Huang, Shizhou
    Sha, Chaofeng
    Wang, Hongya
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1215 - 1223