SINGAN: Singing Voice Conversion with Generative Adversarial Networks

被引:0
|
作者
Sisman, Berrak [1 ,2 ]
Vijayan, Karthika [1 ]
Dong, Minghui [2 ]
Li, Haizhou [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] ASTAR, Inst Infocomm Res, Singapore, Singapore
基金
新加坡国家研究基金会;
关键词
Singing voice conversion; generative adversarial networks; singing voice;
D O I
10.1109/apsipaasc47483.2019.9023162
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Singing voice conversion (SVC) is a task to convert the source singer's voice to sound like that of the target singer, without changing the lyrical content. So far, most of the voice conversion studies mainly focus only on the speech voice conversion that is different from singing voice conversion. We note that singing conveys both lexical and emotional information through words and tones. It is one of the most expressive components in music and a means of entertainment as well as self expression. In this paper, we propose a novel singing voice conversion framework, that is based on Generative Adversarial Networks (GANs). The proposed CAN-based conversion framework, that we call SINGAN, consists of two neural networks: a discriminator to distinguish natural and converted singing voice, and a generator to deceive the discriminator. With CAN, we minimize the differences of the distributions between the original target parameters and the generated singing parameters. To our best knowledge, this is the first framework that uses generative adversarial networks for singing voice conversion. In experiments, we show that the proposed method effectively converts singing voices and outperforms the baseline approach.
引用
收藏
页码:112 / 118
页数:7
相关论文
共 50 条
  • [41] Image Style Conversion Model Design Based on Generative Adversarial Networks
    Gong, Ke
    Zhen, Zhu
    IEEE ACCESS, 2024, 12 : 122126 - 122138
  • [42] Mandarin Singing Synthesis Based on Generative Adversarial Network
    Zhou, Yun
    Yang, Hongwu
    Chen, Ziyan
    Yan, Yajing
    2020 IEEE 3RD INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP 2020), 2020, : 139 - 142
  • [43] Minimum acceptance criteria for subsurface scenario-based uncertainty models from single image generative adversarial networks (SinGAN)
    Liu, Lei
    Salazar, Jose J.
    Jo, Honggeun
    Prodanovic, Masa
    Pyrcz, Michael J.
    COMPUTATIONAL GEOSCIENCES, 2025, 29 (01)
  • [44] STARGAN-VC: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING STAR GENERATIVE ADVERSARIAL NETWORKS
    Kameoka, Hirokazu
    Kaneko, Takuhiro
    Tanaka, Kou
    Hojo, Nobukatsu
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 266 - 273
  • [45] Neutral-to-emotional voice conversion with cross-wavelet transform F0 using generative adversarial networks
    Luo, Zhaojie
    Chen, Jinhui
    Takiguchi, Tetsuya
    Ariki, Yasuo
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2019, 8 : 1 - 11
  • [46] Exploring generative adversarial networks and adversarial training
    Sajeeda A.
    Hossain B.M.M.
    Int. J. Cogn. Comp. Eng., (78-89): : 78 - 89
  • [47] An Adaptive-Learning-Based Generative Adversarial Network for One-to-One Voice Conversion
    Dhar S.
    Jana N.D.
    Das S.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (01): : 92 - 106
  • [48] Interpretable Generative Adversarial Networks
    Li, Chao
    Yao, Kelu
    Wang, Jin
    Diao, Boyu
    Xu, Yongjun
    Zhang, Quanshi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1280 - 1288
  • [49] Steganographic Generative Adversarial Networks
    Volkhonskiy, Denis
    Nazarov, Ivan
    Burnaev, Evgeny
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [50] Wasserstein Generative Adversarial Networks
    Arjovsky, Martin
    Chintala, Soumith
    Bottou, Leon
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70