Error-driven HMM-based chunk tagger with context-dependent lexicon

被引:0
|
作者
Zhou, GD [1 ]
Su, R [1 ]
机构
[1] Kent Ridge Digital Labs, Singapore 119613, Singapore
来源
PROCEEDINGS OF THE 2000 JOINT SIGDAT CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND VERY LARGE CORPORA | 2000年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new error-driven HMM-based text chunk tagger with context-dependent lexicon. Compared with standard HMM-based tagger, this tagger uses a new Hidden Markov Modelling approach which incorporates more contextual information into a lexical entry. Moreover, an error-driven learning approach is adopted to decrease the memory requirement by keeping only positive lexical entries and makes it possible to further incorporate more context-dependent lexical entries. Experiments show that this technique achieves overall precision and recall rates of 93.40% and 93.95% for all chunk types, 93.60% and 94.64% for noun phrases, and 94.64% and 94.75% for verb phrases when trained on PENN WSJ TreeBank section 00-19 and tested on section 20-24, while 25-fold validation experiments of PENN WSJ TreeBank show overall precision and recall rates of 96.40% and 96.47% for all chunk types, 96.49% and 96.99% for noun phrases, and 97.13% and 97.36% for verb phrases.
引用
收藏
页码:71 / 79
页数:9
相关论文
共 50 条
  • [1] Named entity recognition using an HMM-based chunk tagger
    Zhou, GD
    Su, J
    40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 473 - 480
  • [2] Context-Dependent Labels for an HMM-Based Speech Synthesis System for Malay
    Mustafa, Mumtaz B.
    Don, Zuraidah M.
    Knowles, Gerry
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [3] Integration of context-dependent durational knowledge into HMM-based speech recognition
    Wang, X
    tenBosch, LFM
    Pols, LCW
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1073 - 1076
  • [4] On the use of context-dependent modeling units for HMM-based offline handwriting recognition
    Fink, Gernot A.
    Ploetz, Thomas
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 729 - 733
  • [5] Context-dependent substroke model for HMM-based on-line handwriting recognition
    Tokuno, J
    Inami, N
    Matsuda, S
    Nakai, M
    Shimodaira, H
    Sagayama, S
    EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 78 - 83
  • [6] Context-dependent substroke model for HMM-based on-line handwriting recognition
    Graduate School of Information Science, Japan Advanced Institute of Science and Technology, Japan
    不详
    Proc. Int. Workshop Front. Handwriting Recogn. IWFHR, (78-83):
  • [7] Context-dependent additive log F0 model for HMM-based speech synthesis
    Zen, Heiga
    Braunschweiler, Norbert
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2039 - 2042
  • [8] Enhancing HMM-based POS tagger for Mizo language
    Nunsanga, Morrel V. L.
    Pakray, Partha
    Devi, Toijam Sonalika
    Singh, L. Lolit Kr
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 11725 - 11736
  • [9] Using Synthetic Clinical Data to Train an HMM-Based POS Tagger
    Knoll, Benjamin C.
    Melton, Genevieve B.
    Liu, Hongfang
    Xu, Hua
    Pakhomov, Serguei V. S.
    2016 3RD IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS, 2016, : 252 - 255
  • [10] HMM-Based Lexicon-Driven and Lexicon-Free Word Recognition for Online Handwritten Indic Scripts
    Bharath, A.
    Madhvanath, Sriganesh
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) : 670 - 682