Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

被引:0
作者
Wang, Peng [2 ,3 ]
Yang, Yifan [1 ]
Bang, Zheng [1 ]
Tan, Tian [1 ]
Zhang, Shiliang [4 ]
Chen, Xie [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[2] Chinese Acad Sci, Key Lab Speech Acoust & Content Understanding, Inst Acoust, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
来源
INTERSPEECH 2024 | 2024年
基金
中国国家自然科学基金;
关键词
named entity recognition; factorized neural Transducer; class-based language model; beam search; SPEECH RECOGNITION; ASR;
D O I
10.21437/Interspeech.2024-653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite advancements of end-to-end (E2E) models in speech recognition, named entity recognition (NER) is still challenging but critical for semantic understanding. Previous studies mainly focus on various rule-based or attention-based contextual biasing algorithms. However, their performance might be sensitive to the biasing weight or degraded by excessive attention to the named entity list, along with a risk of false triggering. Inspired by the success of the class-based language model (LM) in NER in conventional hybrid systems and the effective decoupling of acoustic and linguistic information in the factorized neural Transducer (FNT), we propose C-FNT, a novel E2E model that incorporates class-based LMs into FNT. In C-FNT, the LM score of named entities can be associated with the name class instead of its surface form. The experimental results show that our proposed C-FNT significantly reduces error in named entities without hurting performance in general word recognition.
引用
收藏
页码:742 / 746
页数:5
相关论文
共 50 条
  • [41] A Comparative Study of Named Entity Recognition on Myanmar Language
    Nandar, Tin Latt
    Soe, Thinn Lai
    Soe, Khin Mar
    PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 60 - 64
  • [42] Class incremental named entity recognition without forgetting
    Liu, Ye
    Huang, Shaobin
    Wei, Chi
    Tian, Sicheng
    Li, Rongsheng
    Yan, Naiyu
    Du, Zhijuan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (01) : 301 - 324
  • [43] HistNERo: Historical Named Entity Recognition for the Romanian Language
    Avram, Andrei-Marius
    Iuga, Andreea
    Manolache, George-Vlad
    Matei, Vlad-Cristian
    Miclius, Razvan-Gabriel
    Muntean, Vlad-Andrei
    Sorlescu, Manuel-Petru
    Serban, Dragon-Andrei
    Urse, Adrian-Dinu
    Pais, Vasile
    Cerce, Dumitru-Clementin
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT III, 2024, 14806 : 126 - 144
  • [44] Terminologies augmented recurrent neural network model for clinical named entity recognition
    Lerner, Ivan
    Paris, Nicolas
    Tannier, Xavier
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 102
  • [45] A neural model for text localization, transcription and named entity recognition in full pages
    Carbonell, Manuel
    Fornes, Alicia
    Villegas, Mauricio
    Llados, Josep
    PATTERN RECOGNITION LETTERS, 2020, 136 : 219 - 227
  • [46] Recurrent neural networks for Turkish named entity recognition
    Gungor, Onur
    Uskudarli, Suzan
    Gungor, Tunga
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [47] Attentive Neural Network for Named Entity Recognition in Vietnamese
    Kim Anh Nguyen
    Ngan Dong
    Cam-Tu Nguyen
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 266 - 271
  • [48] Neural Networks for Featureless Named Entity Recognition in Czech
    Strakova, Jana
    Straka, Milan
    Hajic, Jan
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 173 - 181
  • [49] Advanced Long-Content Speech Recognition With Factorized Neural Transducer
    Gong, Xun
    Wu, Yu
    Li, Jinyu
    Liu, Shujie
    Zhao, Rui
    Chen, Xie
    Qian, Yanmin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1803 - 1815
  • [50] A hybrid model for Chinese named entity recognition
    Sun, Xiao
    Huang, Degen
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237