Speech Recognition Model Inspired on Large Language Model for Smart Grid Dispatching

被引：0

作者：

Na, Qionglan ^{[1
]}

Yang, Yixi ^{[2
]}

Su, Dan ^{[1
]}

Li, Xin ^{[1
]}

Wang, Yifei ^{[1
]}

Chen, Zhongtao ^{[1
]}

机构：

[1] State Grid Jibei Informat & Telecommun Co, Beijing 100053, Peoples R China

[2] State Grid Informat & Telecommun Branch, Beijing 100761, Peoples R China

来源：

PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND ARTIFICIAL INTELLIGENCE, PEAI 2024 | 2024年

关键词：

Grid; Speech recognition; Smart grid; Deep learning; FUTURE; CELLS;

D O I：

10.1145/3674225.3674303

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, large language models have gained popularity across various domains, with particular attention given to the impressive performance of their core component, the Transformer. This paper aims to enhance the accuracy of intelligent power grid dispatch speech recognition by leveraging deep learning techniques, specifically CNN and Transformer architectures. The proposed approach involves the creation of a specialized corpus tailored specifically for power dispatch speech recognition, focusing on power dispatch-specific terminology and regional grid dispatch language. The acoustic model training utilizes deep neural networks as the fundamental framework. Inspired by the success of Transformers in large language models, we incorporate Transformers as the language model to further enhance prediction performance. The practical results highlight the superiority of the Transformer-based power dispatch speech recognition compared to traditional speech recognition frameworks. With an impressive accuracy in power dispatch speech recognition, the developed system based on this approach has been successfully deployed and validated in a regional grid control center, affirming its feasibility and effectiveness.

引用

页码：439 / 442

页数：4

共 50 条

[21] A smart dispatching business maturity model of dispatching center and its application
Liu, Kaicheng
He, Guangyu
Wang, Bin
Liu, Feng
Gu, Zhidong
Huang, Liangyi
Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2012, 36 (13): : 1 - 7
[22] Combination of Random Indexing based Language Model and N-gram Language Model for Speech Recognition
Fohr, Dominique
Mella, Odile
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2231 - 2235
[23] Recurrent Neural Network Language Model with Part-of-speech for Mandarin Speech Recognition
Gong, Caixia
Li, Xiangang
Wu, Xihong
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 459 - 463
[24] DOCUMENT-SPECIFIC CONTEXT PLSA LANGUAGE MODEL FOR SPEECH RECOGNITION
Haidar, Md Akmal
O'Shaughnessy, Douglas
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5326 - 5330
[25] Document filtering based on spectral clustering for speech recognition language model
Takahashi, Shinya
Morimoto, Tsuyoshi
Tsuruta, Naoyuki
IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 393 - +
[26] Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations
Pelemans, Joris
Vanallemeersch, Tom
Demuynck, Kris
Van Hamme, Hugo
Wambacq, Patrick
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2262 - 2266
[27] A study of speech recognition based on RNN-RBM language model
Li, Yaxiong, 1936, Science Press (51): : 1936 - 1944
[28] Research on Syllable-Based Language Model in Malay Speech Recognition
Wei, Xiangfeng
Zhang, Quan
Yuan, Yi
2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 150 - 155
[29] Speech Recognition Model for Assamese Language Using Deep Neural Network
Singh, Moirangthem Tiken
Barman, Partha Pratim
Gogoi, Rupjyoti
2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 2722 - 2727
[30] Speech recognition model design for Sundanese language using WAV2VEC 2.0
Cryssiover A.
Zahra A.
International Journal of Speech Technology, 2024, 27 (01) : 171 - 177

← 1 2 3 4 5 →