A Multi-Scale Fully Convolutional Network for Singing Melody Extraction

被引:0
|
作者
Gao, Ping [1 ]
You, Cheng-You [1 ]
Chi, Tai-Shih [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 300, Taiwan
关键词
D O I
10.1109/apsipaasc47483.2019.9023231
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The melody extraction can be considered as a sequence-to-sequence task or a classification task. Many recent models based on semantic segmentation have been proven very effective in melody extraction. In this paper, we built up a fully convolutional network (FCN) for melody extraction from polyphonic music. Inspired by the state-of-the-art architecture of the semantic segmentation, we constructed the encoder in a dense way and designed the decoder accordingly for audio processing. The combined frequency and periodicity (CFP) representation, which contains spectral and cepstral information, was adopted as the input feature of the proposed model. We conducted performance comparison between the proposed model and several methods on various datasets. Experimental results show the proposed model achieves state-of-the-art performance with less computation and fewer parameters.
引用
收藏
页码:1288 / 1293
页数:6
相关论文
共 50 条
  • [31] Adversarial learning for deformable registration of brain MR image using a multi-scale fully convolutional network
    Duan, Luwen
    Yuan, Gang
    Gong, Lun
    Fu, Tianxiao
    Yang, Xiaodong
    Chen, Xinjian
    Zheng, Jian
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 53
  • [32] Land cover classification from remote sensing images based on multi-scale fully convolutional network
    Li, Rui
    Zheng, Shunyi
    Duan, Chenxi
    Wang, Libo
    Zhang, Ce
    GEO-SPATIAL INFORMATION SCIENCE, 2022, 25 (02) : 278 - 294
  • [33] Joint optic disc and cup segmentation based on residual multi-scale fully convolutional neural network
    Yuan X.
    Zheng X.
    Ji B.
    Li M.
    Li B.
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2020, 37 (05): : 875 - 884
  • [34] Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions
    Duc My Vo
    Sang-Woong Lee
    Multimedia Tools and Applications, 2018, 77 : 18689 - 18707
  • [35] Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions
    Duc My Vo
    Lee, Sang-Woong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 18689 - 18707
  • [36] Multi-Scale and Single-Scale Fully Convolutional Networks for Sound Event Detection
    Wang, Yingbin
    Zhao, Guanghui
    Xiong, Kai
    Shi, Guangming
    Zhang, Yumeng
    NEUROCOMPUTING, 2021, 421 : 51 - 65
  • [37] Multi-Scale and Single-Scale Fully Convolutional Networks for Sound Event Detection
    Wang Y.
    Zhao G.
    Xiong K.
    Shi G.
    Zhang Y.
    Neurocomputing, 2021, 421 : 51 - 65
  • [38] A Multi-Scale Fusion Convolutional Neural Network for Face Detection
    Chen, Qiaosong
    Meng, Xiaomin
    Li, Wen
    Fu, Xingyu
    Deng, Xin
    Wang, Jin
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1013 - 1018
  • [39] Learning Environmental Sounds with Multi-scale Convolutional Neural Network
    Zhu, Boqing
    Wang, Changjian
    Liu, Feng
    Lei, Jin
    Huang, Zhen
    Peng, Yuxing
    Li, Fei
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [40] Multi-Scale Convolutional Network for Person Re-identification
    Wu, Qiong
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGY (CNCT 2016), 2016, 54 : 826 - 835