Generation of suprasegmental information for speech using a recurrent neural network and binary gravitational search algorithm for feature selection

被引:8
|
作者
Sheikhan, Mansour [1 ]
机构
[1] Islamic Azad Univ, Fac Engn, Dept Elect Engn, South Tehran Branch, Tehran, Iran
关键词
Prosody generation; Recurrent neural network; Feature selection; Binary gravitational search algorithm; Binary particle swarm optimization; Modified MOS scale; PARTICLE SWARM OPTIMIZATION; GENETIC ALGORITHM; OF-SPEECH; PROSODY; SYSTEMS; CLASSIFIER; CONTOURS; EMOTION; CONTEXT; GSA;
D O I
10.1007/s10489-013-0505-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Suprasegmental (prosody) features of discourse provide a vehicle by which speakers reflect their mental purposes to listeners. Generating suitable prosody information is critical to expressing messages and improving the intelligibility and naturalness of synthetic speech. Generic prosody generators should provide information about pitch frequency (F (0)) contours, energy levels, word durations, and inter-word pause durations for speech synthesizers. The present study used a recurrent neural network (RNN) for prosody generation. The inputs of this RNN were word-level and syllable-level linguistic features. To provide data efficiently for the RNN-based prosody generator in the training, validation, and test phases, automatic segmentation and labeling of phonemes were performed. The number of inputs to the RNN was reduced by employing a binary gravitational search algorithm (BGSA) for feature selection (FS). The proposed prosody generator provided 12 output prosodic parameters for the current syllable for representing pitch contour, log-energy contour, inter-syllable pause duration, duration of syllable, duration of the vowel in the syllable, and vowel onset time. Experimental results demonstrated the success of the RNN-based prosody generator in synthesizing the six prosodic elements with acceptable root mean square error (RMSE). By using a BGSA-based FS unit, a lighter neural model was achieved with a 53 % reduction in the number of weight connections, producing RMSEs with acceptable degradation over the no-FS unit prosody generator. The performance of the BGSA-based FS method was compared with a binary particle swarm optimization (BPSO) algorithm, and the BGSA showed slightly better results. A modified mean opinion score scale was used to evaluate the intelligibility and naturalness of synthesized speech using the proposed method.
引用
收藏
页码:772 / 790
页数:19
相关论文
共 50 条
  • [41] Effect on speech emotion classification of a feature selection approach using a convolutional neural network
    Amjad, Ammar
    Khan, Lal
    Chang, Hsien-Tsung
    PEERJ COMPUTER SCIENCE, 2021, 7
  • [42] Integration of Kestrel-based search algorithm with artificial neural network for feature subset selection
    Agbehadji, Israel Edem
    Millham, Richard C.
    Fong, Simon James
    Yang, Hongji
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2019, 13 (04) : 222 - 233
  • [43] Genetic Algorithm-Neural Network (GANN): a study of neural network activation functions and depth of genetic algorithm search applied to feature selection
    Dong Ling Tong
    Robert Mintram
    International Journal of Machine Learning and Cybernetics, 2010, 1 : 75 - 87
  • [44] Global Best Guided Binary Crow Search Algorithm for Feature Selection
    Agarwal, Unnati
    Sahu, Tirath Prasad
    DISTRIBUTED COMPUTING AND OPTIMIZATION TECHNIQUES, ICDCOT 2021, 2022, 903 : 481 - 491
  • [45] Effect on speech emotion classification of a feature selection approach using a convolutional neural network
    Amjad A.
    Khan L.
    Chang H.-T.
    PeerJ Computer Science, 2021, 7
  • [46] Binary Ebola Optimization Search Algorithm for Feature Selection and Classification Problems
    Akinola, Olatunji
    Oyelade, Olaide N.
    Ezugwu, Absalom E.
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [47] A V-Shaped Binary Crow Search Algorithm for Feature Selection
    Thom de Souza, Rodrigo Clemente
    de Macedo, Camila Andrade
    Coelho, Leandro dos Santos
    Pierezan, Juliano
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 157 - 164
  • [48] An Improved Binary Cuckoo Search Algorithm For Feature Selection Using Filter Method And Chaotic Map
    Feizi-Derakhsh, Mohammad-Reza
    Kadhim, Estabraq Abdulredaa
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2022, 26 (06): : 897 - 903
  • [49] Wrapper-based Feature Selection for Imbalanced Data using Binary Queuing Search Algorithm
    Thaher, Thaer
    Mafarja, Majdi
    Abdalhaq, Baker
    Chantar, Hamouda
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 318 - 323
  • [50] A hybrid Gravitational Search Algorithm-Genetic Algorithm for neural network training
    Sheikhpour, Saeide
    Sabouri, Mahdieh
    Zahiri, Seyed-Hamid
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,