Generation of suprasegmental information for speech using a recurrent neural network and binary gravitational search algorithm for feature selection

被引:8
|
作者
Sheikhan, Mansour [1 ]
机构
[1] Islamic Azad Univ, Fac Engn, Dept Elect Engn, South Tehran Branch, Tehran, Iran
关键词
Prosody generation; Recurrent neural network; Feature selection; Binary gravitational search algorithm; Binary particle swarm optimization; Modified MOS scale; PARTICLE SWARM OPTIMIZATION; GENETIC ALGORITHM; OF-SPEECH; PROSODY; SYSTEMS; CLASSIFIER; CONTOURS; EMOTION; CONTEXT; GSA;
D O I
10.1007/s10489-013-0505-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Suprasegmental (prosody) features of discourse provide a vehicle by which speakers reflect their mental purposes to listeners. Generating suitable prosody information is critical to expressing messages and improving the intelligibility and naturalness of synthetic speech. Generic prosody generators should provide information about pitch frequency (F (0)) contours, energy levels, word durations, and inter-word pause durations for speech synthesizers. The present study used a recurrent neural network (RNN) for prosody generation. The inputs of this RNN were word-level and syllable-level linguistic features. To provide data efficiently for the RNN-based prosody generator in the training, validation, and test phases, automatic segmentation and labeling of phonemes were performed. The number of inputs to the RNN was reduced by employing a binary gravitational search algorithm (BGSA) for feature selection (FS). The proposed prosody generator provided 12 output prosodic parameters for the current syllable for representing pitch contour, log-energy contour, inter-syllable pause duration, duration of syllable, duration of the vowel in the syllable, and vowel onset time. Experimental results demonstrated the success of the RNN-based prosody generator in synthesizing the six prosodic elements with acceptable root mean square error (RMSE). By using a BGSA-based FS unit, a lighter neural model was achieved with a 53 % reduction in the number of weight connections, producing RMSEs with acceptable degradation over the no-FS unit prosody generator. The performance of the BGSA-based FS method was compared with a binary particle swarm optimization (BPSO) algorithm, and the BGSA showed slightly better results. A modified mean opinion score scale was used to evaluate the intelligibility and naturalness of synthesized speech using the proposed method.
引用
收藏
页码:772 / 790
页数:19
相关论文
共 50 条
  • [1] Generation of suprasegmental information for speech using a recurrent neural network and binary gravitational search algorithm for feature selection
    Mansour Sheikhan
    Applied Intelligence, 2014, 40 : 772 - 790
  • [2] Feature subset selection using improved binary gravitational search algorithm
    Rashedi, Esmat
    Nezamabadi-pour, Hossein
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2014, 26 (03) : 1211 - 1221
  • [3] Hybrid of binary gravitational search algorithm and mutual information for feature selection in intrusion detection systems
    Bostani, Hamid
    Sheikhan, Mansour
    SOFT COMPUTING, 2017, 21 (09) : 2307 - 2324
  • [4] Hybrid of binary gravitational search algorithm and mutual information for feature selection in intrusion detection systems
    Hamid Bostani
    Mansour Sheikhan
    Soft Computing, 2017, 21 : 2307 - 2324
  • [5] Feature Subset Selection Using Binary Gravitational Search Algorithm for Intrusion Detection System
    Behjat, Amir Rajabi
    Mustapha, Aida
    Nezamabadi-pour, Hossein
    Sulaiman, Md. Nasir
    Mustapha, Norwati
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 377 - 386
  • [6] Feature Selection Using an Improved Gravitational Search Algorithm
    Zhu, Lei
    He, Shoushuai
    Wang, Lei
    Zeng, Weijun
    Yang, Jian
    IEEE ACCESS, 2019, 7 : 114440 - 114448
  • [7] Introducing clustering based population in Binary Gravitational Search Algorithm for Feature Selection
    Guha, Ritam
    Ghosh, Manosij
    Chakrabarti, Akash
    Sarkar, Ram
    Mirjalili, Seyedali
    APPLIED SOFT COMPUTING, 2020, 93
  • [8] Feature Selection with a Binary Flamingo Search Algorithm and a Genetic Algorithm
    Eluri, Rama Krishna
    Devarakonda, Nagaraju
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (17) : 26679 - 26730
  • [9] Application of binary quantum-inspired gravitational search algorithm in feature subset selection
    Barani, Fatemeh
    Mirhosseini, Mina
    Nezamabadi-pour, Hossein
    APPLIED INTELLIGENCE, 2017, 47 (02) : 304 - 318
  • [10] Synthesizing Suprasegmental Speech Information Using Hybrid of GA-ACO and Dynamic Neural Network
    Sheikhan, Mansour
    2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 175 - 180