A Multi-Scale Fully Convolutional Network for Singing Melody Extraction

被引:0
|
作者
Gao, Ping [1 ]
You, Cheng-You [1 ]
Chi, Tai-Shih [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 300, Taiwan
关键词
D O I
10.1109/apsipaasc47483.2019.9023231
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The melody extraction can be considered as a sequence-to-sequence task or a classification task. Many recent models based on semantic segmentation have been proven very effective in melody extraction. In this paper, we built up a fully convolutional network (FCN) for melody extraction from polyphonic music. Inspired by the state-of-the-art architecture of the semantic segmentation, we constructed the encoder in a dense way and designed the decoder accordingly for audio processing. The combined frequency and periodicity (CFP) representation, which contains spectral and cepstral information, was adopted as the input feature of the proposed model. We conducted performance comparison between the proposed model and several methods on various datasets. Experimental results show the proposed model achieves state-of-the-art performance with less computation and fewer parameters.
引用
收藏
页码:1288 / 1293
页数:6
相关论文
共 50 条
  • [21] Maritime Semantic Labeling of Optical Remote Sensing Images with Multi-Scale Fully Convolutional Network
    Lin, Haoning
    Shi, Zhenwei
    Zou, Zhengxia
    REMOTE SENSING, 2017, 9 (05)
  • [22] MRI Image Segmentation of Nasopharyngeal Carcinoma Using Multi-Scale Cascaded Fully Convolutional Network
    Guo, Yanfen
    Cui, Zhe
    Li, Xiaojie
    Peng, Jing
    Hu, Jinrong
    Yang, Zhipeng
    Wu, Tao
    Mumtaz, Imran
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (03): : 1771 - 1782
  • [23] Multi-scale fully convolutional network for gland segmentation using three-class classification
    Ding, Huijun
    Pan, Zhanpeng
    Cen, Qian
    Li, Yang
    Chen, Shifeng
    NEUROCOMPUTING, 2020, 380 (380) : 150 - 161
  • [24] A multi-scale convolutional neural network for heartbeat classification
    Zheng, Lesong
    Zhang, Miao
    Qiu, Lishen
    Ma, Gang
    Zhu, Wenliang
    Wang, Lirong
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1488 - 1492
  • [25] Dense Multi-Scale Convolutional Network for Plant Segmentation
    Tran, Thi Hoang Yen
    Phan, Tran Dang Khoa
    IEEE ACCESS, 2023, 11 : 82640 - 82651
  • [26] Multi-scale convolutional neural network for texture recognition
    Wei, Xile
    Hu, Benyong
    Gao, Tianshi
    Wang, Jiang
    Deng, Bin
    DISPLAYS, 2022, 75
  • [27] FREQUENCY-TEMPORAL ATTENTION NETWORK FOR SINGING MELODY EXTRACTION
    Yu, Shuai
    Sun, Xiaoheng
    Yu, Yi
    Li, Wei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 251 - 255
  • [28] MFC: A MULTI-SCALE FULLY CONVOLUTIONAL APPROACH FOR VISUAL INSTANCE RETRIEVAL
    Hao, Jiedong
    Wang, Wei
    Dong, Jing
    Tan, Tieniu
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [29] A Multi-scale Pyramid of Fully Convolutional Networks for Automatic Cell Detection
    Gu, Jiang
    Zhu, Yichen
    Yang, Bohong
    Jia, Jingkai
    Wang, Juanjuan
    Yang, Jian
    Zhang, Wenqiang
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 633 - 636
  • [30] Fully convolutional multi-scale dense networks for monocular depth estimation
    Liu, Jiwei
    Zhang, Yunzhou
    Cui, Jiahua
    Feng, Yonghui
    Pang, Linzhuo
    IET COMPUTER VISION, 2019, 13 (05) : 515 - 522