Application of Three Different Artificial Neural Network Architectures for Voice Conversion

被引:0
|
作者
Sathe-Pathak, Bageshree [1 ]
Patil, Shalaka [2 ]
Panat, Ashish [1 ]
机构
[1] Priyadarshani Coll Engn, Nagpur, Maharashtra, India
[2] Cummins Coll Engn, Pune, Maharashtra, India
来源
INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, INDIA 2016 | 2016年 / 434卷
关键词
Artificial neural network; Discrete wavelet transform; Packet decomposition; Spectral transformation; Speech transformation; ALGORITHM;
D O I
10.1007/978-81-322-2752-6_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper designs a Multi-scale Spectral transformation technique for Voice Conversion. The proposed algorithm uses Spectral transformation technique designed using multi-resolution wavelet feature set and a Neural Network to generate a mapping function between source and target speech. Dynamic Frequency Warping technique is used for aligning source and target speech and Overlap-Add method is used for minimizing the distortions that occur in the reconstruction process. With the use of Neural Network, mapping of spectral parameters between source and target speech has been achieved more efficiently. In this paper, the mapping function is generated in three different ways, using three types of Neural Networks namely, Feed Forward Neural Network, Generalized Regression Neural Network and Radial Basis Neural Network. Results of all three Neural Networks are compared using execution time requirements and Subjective analysis. The main advantage of this approach is that it is speech as well as speaker independent algorithm.
引用
收藏
页码:237 / 246
页数:10
相关论文
共 50 条
  • [1] A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion
    Hwang, Hsin-Te
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 552 - 558
  • [2] Versatile Architectures of Artificial Neural Network with Variable Capacity
    Basiri, M. Mohamed Asan
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (11) : 6333 - 6353
  • [3] Versatile Architectures of Artificial Neural Network with Variable Capacity
    M. Mohamed Asan Basiri
    Circuits, Systems, and Signal Processing, 2022, 41 : 6333 - 6353
  • [4] Conversion of one Emotional state to other of a Speech Signal using Artificial Neural Network
    Sathe-Pathak, Bageshree
    Patil, Shalaka
    Panat, Ashish
    2015 1ST INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2015, : 831 - 835
  • [5] Implementation of Artificial Neural Network and Multilevel of Discrete Wavelet Transform for Voice Recognition
    Suksiri, Bandhit
    Fukumoto, Masahiro
    COMPUTER AND INFORMATION SCIENCE, 2016, 656 : 15 - 26
  • [6] Application of artificial neural network to simultaneous spectrophotometric determination of three components dyestuff
    Lin, SL
    Xie, CS
    Wang, JD
    Chen, ZR
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2003, 23 (06) : 1135 - 1138
  • [7] Development and Application of Artificial Neural Network
    Wu, Yu-chen
    Feng, Jun-wen
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (02) : 1645 - 1656
  • [8] Development and Application of Artificial Neural Network
    Yu-chen Wu
    Jun-wen Feng
    Wireless Personal Communications, 2018, 102 : 1645 - 1656
  • [9] Application of Artificial Neural Network for the Prediction of Copper Ore Grade
    Tsae, Ntshiri Batlile
    Adachi, Tsuyoshi
    Kawamura, Youhei
    MINERALS, 2023, 13 (05)
  • [10] APPLICATION OF ARTIFICIAL NEURAL NETWORK IN THE EFFICIENT CONTROL OF THREE-PHASE INDUCTION MOTOR
    Santos, Arineu F.
    Neves, Francisco A. S.
    Aquino, Ronaldo R. B.
    Cavalcanti, Marcelo C.
    2009 BRAZILIAN POWER ELECTRONICS CONFERENCE, VOLS 1 AND 2, 2009, : 292 - 299