Generation of suprasegmental information for speech using a recurrent neural network and binary gravitational search algorithm for feature selection

被引:8
|
作者
Sheikhan, Mansour [1 ]
机构
[1] Islamic Azad Univ, Fac Engn, Dept Elect Engn, South Tehran Branch, Tehran, Iran
关键词
Prosody generation; Recurrent neural network; Feature selection; Binary gravitational search algorithm; Binary particle swarm optimization; Modified MOS scale; PARTICLE SWARM OPTIMIZATION; GENETIC ALGORITHM; OF-SPEECH; PROSODY; SYSTEMS; CLASSIFIER; CONTOURS; EMOTION; CONTEXT; GSA;
D O I
10.1007/s10489-013-0505-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Suprasegmental (prosody) features of discourse provide a vehicle by which speakers reflect their mental purposes to listeners. Generating suitable prosody information is critical to expressing messages and improving the intelligibility and naturalness of synthetic speech. Generic prosody generators should provide information about pitch frequency (F (0)) contours, energy levels, word durations, and inter-word pause durations for speech synthesizers. The present study used a recurrent neural network (RNN) for prosody generation. The inputs of this RNN were word-level and syllable-level linguistic features. To provide data efficiently for the RNN-based prosody generator in the training, validation, and test phases, automatic segmentation and labeling of phonemes were performed. The number of inputs to the RNN was reduced by employing a binary gravitational search algorithm (BGSA) for feature selection (FS). The proposed prosody generator provided 12 output prosodic parameters for the current syllable for representing pitch contour, log-energy contour, inter-syllable pause duration, duration of syllable, duration of the vowel in the syllable, and vowel onset time. Experimental results demonstrated the success of the RNN-based prosody generator in synthesizing the six prosodic elements with acceptable root mean square error (RMSE). By using a BGSA-based FS unit, a lighter neural model was achieved with a 53 % reduction in the number of weight connections, producing RMSEs with acceptable degradation over the no-FS unit prosody generator. The performance of the BGSA-based FS method was compared with a binary particle swarm optimization (BPSO) algorithm, and the BGSA showed slightly better results. A modified mean opinion score scale was used to evaluate the intelligibility and naturalness of synthesized speech using the proposed method.
引用
收藏
页码:772 / 790
页数:19
相关论文
共 50 条
  • [31] Optimum Network Selection in Heterogeneous wireless environment using Gravitational Search Algorithm
    Kumari, P. Aruna
    Prabha, I. Santi
    2015 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION ENGINEERING SYSTEMS (SPACES), 2015, : 464 - 467
  • [32] Speech Emotion Feature Selection Method Based on Contribution Analysis Algorithm of Neural Network
    Wang, Xiaojia
    Mao, Qirong
    Zhan, Yongzhao
    INTERNATIONAL ELECTRONIC CONFERENCE ON COMPUTER SCIENCE, 2008, 1060 : 336 - 339
  • [33] Improved Binary Symbiotic Organism Search Algorithm With Transfer Functions for Feature Selection
    Du, Zhi-Gang
    Pan, Jeng-Shyang
    Chu, Shu-Chuan
    Chiu, Yi-Jui
    IEEE ACCESS, 2020, 8 : 225730 - 225744
  • [34] A Novel Extended Binary Cuckoo Search Algorithm for Feature Selection
    Salesi, Sadegh
    Cosma, Georgina
    PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND APPLICATIONS (ICKEA), 2017, : 6 - 12
  • [35] Binary Symbiotic Organism Search Algorithm for Feature Selection and Analysis
    Han, Cao
    Zhou, Guo
    Zhou, Yongquan
    IEEE ACCESS, 2019, 7 : 166833 - 166859
  • [36] Evolving an Adaptive Artificial Neural Network with a Gravitational Search Algorithm
    Tan, Shing Chiang
    Lim, Chee Peng
    INTELLIGENT DECISION TECHNOLOGIES, 2015, 39 : 599 - 609
  • [37] Comparison of Using the Genetic Algorithm and Cuckoo Search for Feature Selection
    Kaya, Yasin
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [38] A Novel Feature Selection Method on Mutual Information and Improved Gravitational Search Algorithm for High Dimensional Biomedical Data
    Yan, Chaokun
    Kang, Xi
    Li, Mengyuan
    Wang, Jianlin
    2021 THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2021), 2021, : 24 - 30
  • [39] An Improved Binary Whale Optimization Algorithm for Feature Selection of Network Intrusion Detection
    Xu, Hui
    Fu, Yingchun
    Fang, Ce
    Cao, Qianqian
    Su, Jun
    Wei, Siwei
    PROCEEDINGS OF THE 2018 IEEE 4TH INTERNATIONAL SYMPOSIUM ON WIRELESS SYSTEMS WITHIN THE INTERNATIONAL CONFERENCES ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS (IDAACS-SWS), 2018, : 10 - 15
  • [40] Genetic Algorithm-Neural Network (GANN): a study of neural network activation functions and depth of genetic algorithm search applied to feature selection
    Tong, Dong Ling
    Mintram, Robert
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2010, 1 (1-4) : 75 - 87