Speech Recognition Model Inspired on Large Language Model for Smart Grid Dispatching

被引:0
|
作者
Na, Qionglan [1 ]
Yang, Yixi [2 ]
Su, Dan [1 ]
Li, Xin [1 ]
Wang, Yifei [1 ]
Chen, Zhongtao [1 ]
机构
[1] State Grid Jibei Informat & Telecommun Co, Beijing 100053, Peoples R China
[2] State Grid Informat & Telecommun Branch, Beijing 100761, Peoples R China
来源
PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND ARTIFICIAL INTELLIGENCE, PEAI 2024 | 2024年
关键词
Grid; Speech recognition; Smart grid; Deep learning; FUTURE; CELLS;
D O I
10.1145/3674225.3674303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, large language models have gained popularity across various domains, with particular attention given to the impressive performance of their core component, the Transformer. This paper aims to enhance the accuracy of intelligent power grid dispatch speech recognition by leveraging deep learning techniques, specifically CNN and Transformer architectures. The proposed approach involves the creation of a specialized corpus tailored specifically for power dispatch speech recognition, focusing on power dispatch-specific terminology and regional grid dispatch language. The acoustic model training utilizes deep neural networks as the fundamental framework. Inspired by the success of Transformers in large language models, we incorporate Transformers as the language model to further enhance prediction performance. The practical results highlight the superiority of the Transformer-based power dispatch speech recognition compared to traditional speech recognition frameworks. With an impressive accuracy in power dispatch speech recognition, the developed system based on this approach has been successfully deployed and validated in a regional grid control center, affirming its feasibility and effectiveness.
引用
收藏
页码:439 / 442
页数:4
相关论文
共 50 条
  • [31] A model for cloud-based large scale smart grid technologies
    Aziz, S.
    Joseph, Meera K.
    Ferreira, H. C.
    2017 1ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2017 17TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC / I&CPS EUROPE), 2017,
  • [32] Towards a Deep Speech Model for Romanian Language
    Panaite, Marilena
    Ruseti, Stefan
    Dascalu, Mihai
    Trausan-Matu, Stefan
    2019 22ND INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS), 2019, : 416 - 419
  • [33] SPEECH RECOGNITION MODEL COMPRESSION
    Sakthi, Madhumitha
    Tewfik, Ahmed
    Pawate, Raj
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7869 - 7873
  • [34] PROMPTING LARGE LANGUAGE MODELS WITH SPEECH RECOGNITION ABILITIES
    Fathullah, Yassir
    Wu, Chunyang
    Lakomkin, Egor
    Jia, Junteng
    Shangguan, Yuan
    Li, Ke
    Guo, Jinxi
    Xiong, Wenhan
    Mahadeokar, Jay
    Kalinli, Ozlem
    Fuegen, Christian
    Seltzer, Mike
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 13351 - 13355
  • [35] Correction while Recognition: Combining Pretrained Language Model for Taiwan-Accented Speech Recognition
    Li, Sheng
    Li, Jiyi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 389 - 400
  • [36] Potential of Smart grid in Thailand: a Development of WADE Smart Grid Model
    Pisanupoj, Songkran
    Ongsakul, Weerakorn
    Singh, Jai Govind
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE & UTILITY EXHIBITION ON GREEN ENERGY FOR SUSTAINABLE DEVELOPMENT (ICUE), 2014,
  • [37] Resilient Hybrid Overlay Model for Smart Grid: RHM for Smart Grid
    Kher, Shubhalaxmi
    Nutt, Victor
    Dasgupta, Dipankar
    2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN CYBER SECURITY (CICS), 2013, : 45 - 51
  • [38] MEASURING INFORMATION PROVIDED BY LANGUAGE MODEL AND ACOUSTIC MODEL IN PROBABILISTIC SPEECH RECOGNITION - THEORY AND EXPERIMENTAL RESULTS
    FERRETTI, M
    MALTESE, G
    SCARCI, S
    SPEECH COMMUNICATION, 1990, 9 (5-6) : 531 - 539
  • [39] TOPIC N-GRAM COUNT LANGUAGE MODEL ADAPTATION FOR SPEECH RECOGNITION
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 165 - 169
  • [40] Dynamic out-of-vocabulary word registration to language model for speech recognition
    Kitaoka, Norihide
    Chen, Bohan
    Obashi, Yuya
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)