Application of Three Different Artificial Neural Network Architectures for Voice Conversion

被引：0

作者：

Sathe-Pathak, Bageshree ^{[1
]}

Patil, Shalaka ^{[2
]}

Panat, Ashish ^{[1
]}

机构：

[1] Priyadarshani Coll Engn, Nagpur, Maharashtra, India

[2] Cummins Coll Engn, Pune, Maharashtra, India

来源：

INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, INDIA 2016 | 2016年 / 434卷

关键词：

Artificial neural network; Discrete wavelet transform; Packet decomposition; Spectral transformation; Speech transformation; ALGORITHM;

D O I：

10.1007/978-81-322-2752-6_23

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper designs a Multi-scale Spectral transformation technique for Voice Conversion. The proposed algorithm uses Spectral transformation technique designed using multi-resolution wavelet feature set and a Neural Network to generate a mapping function between source and target speech. Dynamic Frequency Warping technique is used for aligning source and target speech and Overlap-Add method is used for minimizing the distortions that occur in the reconstruction process. With the use of Neural Network, mapping of spectral parameters between source and target speech has been achieved more efficiently. In this paper, the mapping function is generated in three different ways, using three types of Neural Networks namely, Feed Forward Neural Network, Generalized Regression Neural Network and Radial Basis Neural Network. Results of all three Neural Networks are compared using execution time requirements and Subjective analysis. The main advantage of this approach is that it is speech as well as speaker independent algorithm.

引用

页码：237 / 246

页数：10

共 50 条

[1] A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion
Hwang, Hsin-Te
Tsao, Yu
Wang, Hsin-Min
Wang, Yih-Ru
Chen, Sin-Horng
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 552 - 558
[2] Versatile Architectures of Artificial Neural Network with Variable Capacity
Basiri, M. Mohamed Asan
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (11) : 6333 - 6353
[3] Versatile Architectures of Artificial Neural Network with Variable Capacity
M. Mohamed Asan Basiri
Circuits, Systems, and Signal Processing, 2022, 41 : 6333 - 6353
[4] Conversion of one Emotional state to other of a Speech Signal using Artificial Neural Network
Sathe-Pathak, Bageshree
Patil, Shalaka
Panat, Ashish
2015 1ST INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2015, : 831 - 835
[5] Implementation of Artificial Neural Network and Multilevel of Discrete Wavelet Transform for Voice Recognition
Suksiri, Bandhit
Fukumoto, Masahiro
COMPUTER AND INFORMATION SCIENCE, 2016, 656 : 15 - 26
[6] Application of artificial neural network to simultaneous spectrophotometric determination of three components dyestuff
Lin, SL
Xie, CS
Wang, JD
Chen, ZR
SPECTROSCOPY AND SPECTRAL ANALYSIS, 2003, 23 (06) : 1135 - 1138
[7] Development and Application of Artificial Neural Network
Wu, Yu-chen
Feng, Jun-wen
WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (02) : 1645 - 1656
[8] Development and Application of Artificial Neural Network
Yu-chen Wu
Jun-wen Feng
Wireless Personal Communications, 2018, 102 : 1645 - 1656
[9] Application of Artificial Neural Network for the Prediction of Copper Ore Grade
Tsae, Ntshiri Batlile
Adachi, Tsuyoshi
Kawamura, Youhei
MINERALS, 2023, 13 (05)
[10] APPLICATION OF ARTIFICIAL NEURAL NETWORK IN THE EFFICIENT CONTROL OF THREE-PHASE INDUCTION MOTOR
Santos, Arineu F.
Neves, Francisco A. S.
Aquino, Ronaldo R. B.
Cavalcanti, Marcelo C.
2009 BRAZILIAN POWER ELECTRONICS CONFERENCE, VOLS 1 AND 2, 2009, : 292 - 299

← 1 2 3 4 5 →