Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

被引:0
|
作者
Wang, Peng [2 ,3 ]
Yang, Yifan [1 ]
Bang, Zheng [1 ]
Tan, Tian [1 ]
Zhang, Shiliang [4 ]
Chen, Xie [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[2] Chinese Acad Sci, Key Lab Speech Acoust & Content Understanding, Inst Acoust, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
来源
INTERSPEECH 2024 | 2024年
基金
中国国家自然科学基金;
关键词
named entity recognition; factorized neural Transducer; class-based language model; beam search; SPEECH RECOGNITION; ASR;
D O I
10.21437/Interspeech.2024-653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite advancements of end-to-end (E2E) models in speech recognition, named entity recognition (NER) is still challenging but critical for semantic understanding. Previous studies mainly focus on various rule-based or attention-based contextual biasing algorithms. However, their performance might be sensitive to the biasing weight or degraded by excessive attention to the named entity list, along with a risk of false triggering. Inspired by the success of the class-based language model (LM) in NER in conventional hybrid systems and the effective decoupling of acoustic and linguistic information in the factorized neural Transducer (FNT), we propose C-FNT, a novel E2E model that incorporates class-based LMs into FNT. In C-FNT, the LM score of named entities can be associated with the name class instead of its surface form. The experimental results show that our proposed C-FNT significantly reduces error in named entities without hurting performance in general word recognition.
引用
收藏
页码:742 / 746
页数:5
相关论文
共 50 条
  • [1] FACTORIZED NEURAL TRANSDUCER FOR EFFICIENT LANGUAGE MODEL ADAPTATION
    Chen, Xie
    Meng, Zhong
    Parthasarathy, Sarangarajan
    Li, Jinyu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8132 - 8136
  • [2] Thai Named-Entity Recognition Using Class-based Language Modeling on Multiple-sized Subword Units
    Saykhum, Kwanchiva
    Boonpiam, Vataya
    Thatphithakkul, Nattanun
    Wutiwiwatchai, Chai
    Natthee, Cholwich
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1586 - +
  • [3] Incorporating token-level dictionary feature into neural model for named entity recognition
    Mu Xiaofeng
    Wang Wei
    Xu Aiping
    NEUROCOMPUTING, 2020, 375 : 43 - 50
  • [4] Named entity Recognition Model for Punjabi Language: A Survey
    Kaur, Pawandeep
    Kaur, Amandeep
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 887 - 891
  • [5] HMM based Named Entity Recognition for Inflectional Language
    Patil, Nita V.
    Patil, Ajay S.
    Pawar, B. V.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS AND ELECTRONICS (COMPTELIX), 2017, : 565 - 572
  • [6] Named Entity Recognition in Marathi Language
    Kale, Shrutika
    Govilkar, Sharvari
    INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 : 371 - 377
  • [7] Named Entity Recognition for Nepali Language
    Singh, Oyesh Mann
    Padia, Ankur
    Joshi, Anupam
    2019 IEEE 5TH INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC 2019), 2019, : 184 - 190
  • [8] Named entity recognition for the Kazakh language
    Kozhirbayev, Z. M.
    Yessenbayev, Z. A.
    JOURNAL OF MATHEMATICS MECHANICS AND COMPUTER SCIENCE, 2020, 107 (03): : 57 - 66
  • [9] Named Entity Recognition for Sinhala Language
    Dahanayaka, J. K.
    Weerasinghe, A. R.
    14TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER) 2014, 2014, : 215 - 220
  • [10] Named Entity Recognition for Malayalam Language: A CRF based Approach
    Prasad, Gowri
    Fousiya, K. K.
    Kumar, M. Anand
    Soman, K. P.
    2015 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), 2015, : 16 - 19