Speech Recognition Model Inspired on Large Language Model for Smart Grid Dispatching

被引：0

作者：

Na, Qionglan ^{[1
]}

Yang, Yixi ^{[2
]}

Su, Dan ^{[1
]}

Li, Xin ^{[1
]}

Wang, Yifei ^{[1
]}

Chen, Zhongtao ^{[1
]}

机构：

[1] State Grid Jibei Informat & Telecommun Co, Beijing 100053, Peoples R China

[2] State Grid Informat & Telecommun Branch, Beijing 100761, Peoples R China

来源：

PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND ARTIFICIAL INTELLIGENCE, PEAI 2024 | 2024年

关键词：

Grid; Speech recognition; Smart grid; Deep learning; FUTURE; CELLS;

D O I：

10.1145/3674225.3674303

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, large language models have gained popularity across various domains, with particular attention given to the impressive performance of their core component, the Transformer. This paper aims to enhance the accuracy of intelligent power grid dispatch speech recognition by leveraging deep learning techniques, specifically CNN and Transformer architectures. The proposed approach involves the creation of a specialized corpus tailored specifically for power dispatch speech recognition, focusing on power dispatch-specific terminology and regional grid dispatch language. The acoustic model training utilizes deep neural networks as the fundamental framework. Inspired by the success of Transformers in large language models, we incorporate Transformers as the language model to further enhance prediction performance. The practical results highlight the superiority of the Transformer-based power dispatch speech recognition compared to traditional speech recognition frameworks. With an impressive accuracy in power dispatch speech recognition, the developed system based on this approach has been successfully deployed and validated in a regional grid control center, affirming its feasibility and effectiveness.

引用

页码：439 / 442

页数：4

共 50 条

[31] A model for cloud-based large scale smart grid technologies
Aziz, S.
Joseph, Meera K.
Ferreira, H. C.
2017 1ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2017 17TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC / I&CPS EUROPE), 2017,
[32] Towards a Deep Speech Model for Romanian Language
Panaite, Marilena
Ruseti, Stefan
Dascalu, Mihai
Trausan-Matu, Stefan
2019 22ND INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS), 2019, : 416 - 419
[33] SPEECH RECOGNITION MODEL COMPRESSION
Sakthi, Madhumitha
Tewfik, Ahmed
Pawate, Raj
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7869 - 7873
[34] PROMPTING LARGE LANGUAGE MODELS WITH SPEECH RECOGNITION ABILITIES
Fathullah, Yassir
Wu, Chunyang
Lakomkin, Egor
Jia, Junteng
Shangguan, Yuan
Li, Ke
Guo, Jinxi
Xiong, Wenhan
Mahadeokar, Jay
Kalinli, Ozlem
Fuegen, Christian
Seltzer, Mike
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 13351 - 13355
[35] Correction while Recognition: Combining Pretrained Language Model for Taiwan-Accented Speech Recognition
Li, Sheng
Li, Jiyi
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 389 - 400
[36] Potential of Smart grid in Thailand: a Development of WADE Smart Grid Model
Pisanupoj, Songkran
Ongsakul, Weerakorn
Singh, Jai Govind
PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE & UTILITY EXHIBITION ON GREEN ENERGY FOR SUSTAINABLE DEVELOPMENT (ICUE), 2014,
[37] Resilient Hybrid Overlay Model for Smart Grid: RHM for Smart Grid
Kher, Shubhalaxmi
Nutt, Victor
Dasgupta, Dipankar
2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN CYBER SECURITY (CICS), 2013, : 45 - 51
[38] MEASURING INFORMATION PROVIDED BY LANGUAGE MODEL AND ACOUSTIC MODEL IN PROBABILISTIC SPEECH RECOGNITION - THEORY AND EXPERIMENTAL RESULTS
FERRETTI, M
MALTESE, G
SCARCI, S
SPEECH COMMUNICATION, 1990, 9 (5-6) : 531 - 539
[39] TOPIC N-GRAM COUNT LANGUAGE MODEL ADAPTATION FOR SPEECH RECOGNITION
Haidar, Md. Akmal
O'Shaughnessy, Douglas
2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 165 - 169
[40] Dynamic out-of-vocabulary word registration to language model for speech recognition
Kitaoka, Norihide
Chen, Bohan
Obashi, Yuya
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)

← 1 2 3 4 5 →