Speech Recognition Model Inspired on Large Language Model for Smart Grid Dispatching

被引：0

作者：

Na, Qionglan ^{[1
]}

Yang, Yixi ^{[2
]}

Su, Dan ^{[1
]}

Li, Xin ^{[1
]}

Wang, Yifei ^{[1
]}

Chen, Zhongtao ^{[1
]}

机构：

[1] State Grid Jibei Informat & Telecommun Co, Beijing 100053, Peoples R China

[2] State Grid Informat & Telecommun Branch, Beijing 100761, Peoples R China

来源：

PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND ARTIFICIAL INTELLIGENCE, PEAI 2024 | 2024年

关键词：

Grid; Speech recognition; Smart grid; Deep learning; FUTURE; CELLS;

D O I：

10.1145/3674225.3674303

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, large language models have gained popularity across various domains, with particular attention given to the impressive performance of their core component, the Transformer. This paper aims to enhance the accuracy of intelligent power grid dispatch speech recognition by leveraging deep learning techniques, specifically CNN and Transformer architectures. The proposed approach involves the creation of a specialized corpus tailored specifically for power dispatch speech recognition, focusing on power dispatch-specific terminology and regional grid dispatch language. The acoustic model training utilizes deep neural networks as the fundamental framework. Inspired by the success of Transformers in large language models, we incorporate Transformers as the language model to further enhance prediction performance. The practical results highlight the superiority of the Transformer-based power dispatch speech recognition compared to traditional speech recognition frameworks. With an impressive accuracy in power dispatch speech recognition, the developed system based on this approach has been successfully deployed and validated in a regional grid control center, affirming its feasibility and effectiveness.

引用

页码：439 / 442

页数：4

共 50 条

[1] Language Model for Speech Recognition of Power Grid Dispatching Based on BERT
Chen L.
Zheng W.
Yu H.
Fu J.
Liu H.
Xia J.
Dianwang Jishu/Power System Technology, 2021, 45 (08): : 2955 - 2961
[2] A Language Model for Intelligent Speech Recognition of Power Dispatching
Zhao, Qing
Li, Tingrui
Luo, Rui
Li, Rui
Han, Tianyu
Han, Dongsheng
PROCEEDINGS OF ACM TURING AWARD CELEBRATION CONFERENCE, ACM TURC 2021, 2021, : 131 - 135
[3] LATENT DIRICHLIET LANGUAGE MODEL FOR SPEECH RECOGNITION
Chien, Jen-Tzung
Chueh, Chuang-Hua
2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 201 - 204
[4] Language Model Score Regularization for Speech Recognition
Zhang Yike
Zhang Pengyuan
Yan Yonghong
CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (03) : 604 - 609
[5] Language Model Score Regularization for Speech Recognition
ZHANG Yike
ZHANG Pengyuan
YAN Yonghong
ChineseJournalofElectronics, 2019, 28 (03) : 604 - 609
[6] TOPIC CACHE LANGUAGE MODEL FOR SPEECH RECOGNITION
Chueh, Chuang-Hua
Chien, Jen-Tzung
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5194 - 5197
[7] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
Sak, Hasim
Beaufays, Francoise
Nakajima, Kaisuke
Allauzen, Cyril
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
[8] Topic tracking language model for speech recognition
Watanabe, Shinji
Iwata, Tomoharu
Hori, Takaaki
Sako, Atsushi
Ariki, Yasuo
COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02) : 440 - 461
[9] Factored Language Model Adaptation Using Dirichlet Class Language Model for Speech Recognition
Hatami, Ali
Akbari, Ahmad
Nasersharif, Babak
2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 438 - 442
[10] STATISTICAL LANGUAGE MODEL ADAPTATION FOR ESTONIAN SPEECH RECOGNITION
Alumaee, Tanel
EESTI RAKENDUSLINGVISTIKA UHINGU AASTARAAMAT, 2008, 4 : 5 - 16

← 1 2 3 4 5 →