Speech Recognition Model Inspired on Large Language Model for Smart Grid Dispatching

被引:0
|
作者
Na, Qionglan [1 ]
Yang, Yixi [2 ]
Su, Dan [1 ]
Li, Xin [1 ]
Wang, Yifei [1 ]
Chen, Zhongtao [1 ]
机构
[1] State Grid Jibei Informat & Telecommun Co, Beijing 100053, Peoples R China
[2] State Grid Informat & Telecommun Branch, Beijing 100761, Peoples R China
来源
PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON POWER ELECTRONICS AND ARTIFICIAL INTELLIGENCE, PEAI 2024 | 2024年
关键词
Grid; Speech recognition; Smart grid; Deep learning; FUTURE; CELLS;
D O I
10.1145/3674225.3674303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, large language models have gained popularity across various domains, with particular attention given to the impressive performance of their core component, the Transformer. This paper aims to enhance the accuracy of intelligent power grid dispatch speech recognition by leveraging deep learning techniques, specifically CNN and Transformer architectures. The proposed approach involves the creation of a specialized corpus tailored specifically for power dispatch speech recognition, focusing on power dispatch-specific terminology and regional grid dispatch language. The acoustic model training utilizes deep neural networks as the fundamental framework. Inspired by the success of Transformers in large language models, we incorporate Transformers as the language model to further enhance prediction performance. The practical results highlight the superiority of the Transformer-based power dispatch speech recognition compared to traditional speech recognition frameworks. With an impressive accuracy in power dispatch speech recognition, the developed system based on this approach has been successfully deployed and validated in a regional grid control center, affirming its feasibility and effectiveness.
引用
收藏
页码:439 / 442
页数:4
相关论文
共 50 条
  • [1] Language Model for Speech Recognition of Power Grid Dispatching Based on BERT
    Chen L.
    Zheng W.
    Yu H.
    Fu J.
    Liu H.
    Xia J.
    Dianwang Jishu/Power System Technology, 2021, 45 (08): : 2955 - 2961
  • [2] A Language Model for Intelligent Speech Recognition of Power Dispatching
    Zhao, Qing
    Li, Tingrui
    Luo, Rui
    Li, Rui
    Han, Tianyu
    Han, Dongsheng
    PROCEEDINGS OF ACM TURING AWARD CELEBRATION CONFERENCE, ACM TURC 2021, 2021, : 131 - 135
  • [3] LATENT DIRICHLIET LANGUAGE MODEL FOR SPEECH RECOGNITION
    Chien, Jen-Tzung
    Chueh, Chuang-Hua
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 201 - 204
  • [4] Language Model Score Regularization for Speech Recognition
    Zhang Yike
    Zhang Pengyuan
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (03) : 604 - 609
  • [5] Language Model Score Regularization for Speech Recognition
    ZHANG Yike
    ZHANG Pengyuan
    YAN Yonghong
    ChineseJournalofElectronics, 2019, 28 (03) : 604 - 609
  • [6] TOPIC CACHE LANGUAGE MODEL FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5194 - 5197
  • [7] LANGUAGE MODEL VERBALIZATION FOR AUTOMATIC SPEECH RECOGNITION
    Sak, Hasim
    Beaufays, Francoise
    Nakajima, Kaisuke
    Allauzen, Cyril
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8262 - 8266
  • [8] Topic tracking language model for speech recognition
    Watanabe, Shinji
    Iwata, Tomoharu
    Hori, Takaaki
    Sako, Atsushi
    Ariki, Yasuo
    COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02) : 440 - 461
  • [9] Factored Language Model Adaptation Using Dirichlet Class Language Model for Speech Recognition
    Hatami, Ali
    Akbari, Ahmad
    Nasersharif, Babak
    2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 438 - 442
  • [10] STATISTICAL LANGUAGE MODEL ADAPTATION FOR ESTONIAN SPEECH RECOGNITION
    Alumaee, Tanel
    EESTI RAKENDUSLINGVISTIKA UHINGU AASTARAAMAT, 2008, 4 : 5 - 16