Generation of suprasegmental information for speech using a recurrent neural network and binary gravitational search algorithm for feature selection

被引:8
|
作者
Sheikhan, Mansour [1 ]
机构
[1] Islamic Azad Univ, Fac Engn, Dept Elect Engn, South Tehran Branch, Tehran, Iran
关键词
Prosody generation; Recurrent neural network; Feature selection; Binary gravitational search algorithm; Binary particle swarm optimization; Modified MOS scale; PARTICLE SWARM OPTIMIZATION; GENETIC ALGORITHM; OF-SPEECH; PROSODY; SYSTEMS; CLASSIFIER; CONTOURS; EMOTION; CONTEXT; GSA;
D O I
10.1007/s10489-013-0505-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Suprasegmental (prosody) features of discourse provide a vehicle by which speakers reflect their mental purposes to listeners. Generating suitable prosody information is critical to expressing messages and improving the intelligibility and naturalness of synthetic speech. Generic prosody generators should provide information about pitch frequency (F (0)) contours, energy levels, word durations, and inter-word pause durations for speech synthesizers. The present study used a recurrent neural network (RNN) for prosody generation. The inputs of this RNN were word-level and syllable-level linguistic features. To provide data efficiently for the RNN-based prosody generator in the training, validation, and test phases, automatic segmentation and labeling of phonemes were performed. The number of inputs to the RNN was reduced by employing a binary gravitational search algorithm (BGSA) for feature selection (FS). The proposed prosody generator provided 12 output prosodic parameters for the current syllable for representing pitch contour, log-energy contour, inter-syllable pause duration, duration of syllable, duration of the vowel in the syllable, and vowel onset time. Experimental results demonstrated the success of the RNN-based prosody generator in synthesizing the six prosodic elements with acceptable root mean square error (RMSE). By using a BGSA-based FS unit, a lighter neural model was achieved with a 53 % reduction in the number of weight connections, producing RMSEs with acceptable degradation over the no-FS unit prosody generator. The performance of the BGSA-based FS method was compared with a binary particle swarm optimization (BPSO) algorithm, and the BGSA showed slightly better results. A modified mean opinion score scale was used to evaluate the intelligibility and naturalness of synthesized speech using the proposed method.
引用
收藏
页码:772 / 790
页数:19
相关论文
共 50 条
  • [21] A Fuzzy Classifier with Feature Selection Based on the Gravitational Search Algorithm
    Bardamova, Marina
    Konev, Anton
    Hodashinsky, Ilya
    Shelupanov, Alexander
    SYMMETRY-BASEL, 2018, 10 (11):
  • [22] Application of binary quantum-inspired gravitational search algorithm in feature subset selection
    Fatemeh Barani
    Mina Mirhosseini
    Hossein Nezamabadi-pour
    Applied Intelligence, 2017, 47 : 304 - 318
  • [23] Optimal feature selection in industrial foam injection processes using hybrid binary Particle Swarm Optimization and Gravitational Search Algorithm in the Mahalanobis–Taguchi System
    Edgar O. Reséndiz-Flores
    Jesús Alejandro Navarro-Acosta
    Agustín Hernández-Martínez
    Soft Computing, 2020, 24 : 341 - 349
  • [24] Feature selection using Binary Crow Search Algorithm with time varying flight length
    Chaudhuri, Abhilasha
    Sahu, Tirath Prasad
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [25] Optimal feature selection in industrial foam injection processes using hybrid binary Particle Swarm Optimization and Gravitational Search Algorithm in the Mahalanobis-Taguchi System
    Resendiz-Flores, Edgar O.
    Alejandro Navarro-Acosta, Jesus
    Hernandez-Martinez, Agustin
    SOFT COMPUTING, 2020, 24 (01) : 341 - 349
  • [26] Efficient feature selection using one-pass generalized classifier neural network and binary bat algorithm with a novel fitness function
    Naik, Akshata K.
    Kuppili, Venkatanareshbabu
    Edla, Damodar Reddy
    SOFT COMPUTING, 2020, 24 (06) : 4575 - 4587
  • [27] Improving binary crow search algorithm for feature selection
    Alnaish, Zakaria A. Hamed A.
    Algamal, Zakariya Yahya
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [28] Binary Owl Search Algorithm for Feature Subset Selection
    Mandal, Ashis Kumar
    Sen, Rikta
    Chakraborty, Basabi
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 186 - 191
  • [29] BSSFS: binary sparrow search algorithm for feature selection
    Lin Sun
    Shanshan Si
    Weiping Ding
    Jiucheng Xu
    Yan Zhang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2633 - 2657
  • [30] Feature Selection for Microarray Data Classification Using Hybrid Information Gain and a Modified Binary Krill Herd Algorithm
    Zhang, Ge
    Hou, Jincui
    Wang, Jianlin
    Yan, Chaokun
    Luo, Junwei
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2020, 12 (03) : 288 - 301