Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems

被引:0
作者
Mao, Tingzhi [1 ]
Khassanov, Yerbolat [2 ,3 ]
Pham, Van Tung [2 ]
Xu, Haihua [2 ]
Huang, Hao [1 ]
Chng, Eng Siong [2 ]
机构
[1] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[3] Nazarbayev Univ, ISSAI, Baku, Azerbaijan
来源
2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2021年
基金
国家重点研发计划;
关键词
speech recognition; named entity recognition; graphemic lexicon; word lattice; word embeddings;
D O I
10.1109/ISCSLP49672.2021.9362062
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a series of complementary approaches to improve the recognition of underrepresented named entities (NE) in hybrid ASR systems without compromising overall word error rate performance. The underrepresented words correspond to rare or out-of-vocabulary (OOV) words in the training data, and thereby can't be modeled reliably. We begin with graphemic lexicon which allows to drop the necessity of phonetic models in hybrid ASR. We study it under different settings and demonstrate its effectiveness in dealing with underrepresented NEs. Next, we study the impact of neural language model (LM) with letter-based features derived to handle infrequent words. After that, we attempt to enrich representations of underrepresented NEs in pretrained neural LM by borrowing the embedding representations of rich-represented words. This let us gain significant performance improvement on underrepresented NE recognition. Finally, we boost the likelihood scores of utterances containing NEs in the word lattices rescored by neural LMs and gain further performance improvement. The combination of the aforementioned approaches improves NE recognition by up to 42% relatively.
引用
收藏
页数:5
相关论文
共 50 条
[41]   Improving unified named entity recognition by incorporating mention relevance [J].
Ji, Lijun ;
Yan, Danfeng ;
Cheng, Zhuoran ;
Song, Yan .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30) :22223-22234
[42]   Improving unified named entity recognition by incorporating mention relevance [J].
Lijun Ji ;
Danfeng Yan ;
Zhuoran Cheng ;
Yan Song .
Neural Computing and Applications, 2023, 35 :22223-22234
[43]   RWTH ASR Systems for LibriSpeech: Hybrid vs Attention [J].
Luescher, Christoph ;
Beck, Eugen ;
Irie, Kazuki ;
Kitza, Markus ;
Michel, Wilfried ;
Zeyer, Albert ;
Schlueter, Ralf ;
Ney, Hermann .
INTERSPEECH 2019, 2019, :231-235
[44]   Named entity recognition using hybrid machine learning approach [J].
Chiong, Raymond ;
Wei, Wang .
PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, :578-583
[45]   Resource-Size matters: Improving Neural Named Entity Recognition with Optimized Large Corpora [J].
Ahmed, Sajawel ;
Mehler, Alexander .
2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, :919-924
[46]   Hybrid Feature Selection Approach for Arabic Named Entity Recognition [J].
Shahine, Miran ;
Sakre, Mohamed .
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 :452-464
[47]   Number Entities Recognition in Multiple Rounds of Dialogue Systems [J].
Zhang, Shan ;
Cao, Bin ;
Xu, Yueshen ;
Fan, Jing .
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2021, 127 (01) :309-323
[48]   Automated Testing and Improvement of Named Entity Recognition Systems [J].
Yu, Boxi ;
Hu, Yiyan ;
Mang, Qiuyang ;
Hu, Wenhan ;
He, Pinjia .
PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, :883-894
[49]   Social Network Science Approaches for Disease Named Entity Recognition and Extraction [J].
Joshi, Sarvesh ;
Kamath, Sowmya S. .
38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, :96-101
[50]   Improving Few-Shot Named Entity Recognition with Causal Interventions [J].
Yang, Zhen ;
Liu, Yongbin ;
Ouyang, Chunping ;
Zhao, Shu ;
Zhu, Chi .
BIG DATA MINING AND ANALYTICS, 2024, 7 (04) :1421-1421