AUTOMATIC SINGING EVALUATION WITHOUT REFERENCE MELODY USING BI-DENSE NEURAL NETWORK

被引:0
作者
Zhang, Ning [1 ]
Jiang, Tao [1 ]
Deng, Feng [1 ]
Li, Yan [1 ]
机构
[1] Kuaishou Technol Co, Beijing, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
singing evaluation; Bi-DenseNet;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic singing evaluation without reference melody has long been a difficult problem. This paper aims to pilot a novel data driven approach to tackle this artistic problem. We constructed a large scale dataset and designed an innovative Bi-Dense neural network which can address this task efficiently. Though the singing evaluation is quite a subjective task and depends a lot on listeners' preferences, we showed that a specific group has consistency on the singing evaluations, and it is possible to train a model to learn the subjective preferences of this group. In this paper, a large amount of singing clips and corresponding human gradings were collected. And an elaborate designed Bi-DenseNet was trained to discriminate the good singings from the poor ones. The experiments demonstrated the proposed network performs better than the existing networks for singing evaluation task.
引用
收藏
页码:466 / 470
页数:5
相关论文
共 20 条
  • [1] [Anonymous], 2013, INT C MACHINE LEARNI
  • [2] Cao Chuan, 2008, 9 ANN C INT SPEECH C
  • [3] Chuan Cao, 2008, 2008 9th International Conference on Signal Processing (ICSP 2008), P1475, DOI 10.1109/ICOSP.2008.4697411
  • [4] Dieleman Sander, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P6964, DOI 10.1109/ICASSP.2014.6854950
  • [5] Garnier Maeva, 2005, J INTERDISCIPLINARY, V1, P62
  • [6] Gomez Emilia, 2000, ARXIV180703046
  • [7] Hershey S, 2017, INT CONF ACOUST SPEE, P131, DOI 10.1109/ICASSP.2017.7952132
  • [8] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [9] Myers Raymond H, 1974, ELEMENTARY APPL STAT
  • [10] NAKANO T, 2006, 9 INT C SPOK LANG PR