Singing voice detection for karaoke application

被引:2
作者
Shenoy, A [1 ]
Wu, YS [1 ]
Wang, Y [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
来源
Visual Communications and Image Processing 2005, Pts 1-4 | 2005年 / 5960卷
关键词
karaoke; singing voice; vocal segmentation; tonic; key; inverse comb filtering; rhythm; lyrics;
D O I
10.1117/12.631645
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
We present a framework to detect the regions of singing voice in musical audio signals. This work is oriented towards the development of a robust transcriber of lyrics for karaoke applications. The technique leverages on a combination of low-level audio features and higher level musical knowledge of rhythm and tonality. Musical knowledge of the key is used to create a song-specific filterbank to attenuate the presence of the pitched musical instruments. This is followed by subband processing of the audio to detect the musical octaves in which the vocals are present. Text processing is employed to approximate the duration of the sung passages using freely available lyrics. This is used to obtain a dynamic threshold for vocal/ non-vocal segmentation. This pairing of audio and text processing helps create a more accurate system. Experimental evaluation on a small database of popular songs shows the validity of the proposed approach. Holistic and per-component evaluation of the system is conducted and various improvements are discussed.
引用
收藏
页码:752 / 762
页数:11
相关论文
共 50 条
  • [21] Estimation of singing voice types based on voice parameters analysis
    Polrolniczak, Edward
    Kramarczyk, Michal
    [J]. 2017 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA 2017), 2017, : 63 - 68
  • [22] Qualitative and Quantitative Measurement of the Singing Voice
    Cesari, U.
    Iengo, M.
    Apisa, P.
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2012, 64 (06) : 304 - 309
  • [23] DEVELOPMENT OF CHILDREN'S SINGING VOICE
    Asztalos, Andrea
    [J]. STUDIA UNIVERSITATIS BABES-BOLYAI MUSICA, 2021, 66 (01): : 39 - 54
  • [24] The Effects of Stress on Singing Voice Accuracy
    Larrouy-Maestri, Pauline
    Morsomme, Dominique
    [J]. JOURNAL OF VOICE, 2014, 28 (01) : 52 - 58
  • [25] Psychosomatic aspects in the treatment of the singing voice
    Spahn, C.
    Voltmer, E.
    [J]. HNO, 2011, 59 (06) : 563 - 567
  • [26] Demystifying trans* plus voice education: The Transgender Singing Voice Conference
    Cayari, Christopher
    [J]. INTERNATIONAL JOURNAL OF MUSIC EDUCATION, 2019, 37 (01) : 118 - 131
  • [27] Voice Timbre Control Based on Perceived Age in Singing Voice Conversion
    Kobayashi, Kazuhiro
    Toda, Tomoki
    Doi, Hironori
    Nakano, Tomoyasu
    Goto, Masataka
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06): : 1419 - 1428
  • [28] Validierung des Singing Voice Handicap Index in der deutschen FassungValidation of the German version of the Singing Voice Handicap Index
    A. Lorenz
    B. Kleber
    M. Büttner
    M. Fuchs
    D. Mürbe
    B. Richter
    M. Sandel
    T. Nawka
    [J]. HNO, 2013, 61 (8) : 699 - 706
  • [29] Establishing Normative Data on Singing Voice Parameters of Children and Adolescents with Average Singing Activity Using the Voice Range Profile
    Dienerowitz, Tobias
    Peschel, Thomas
    Vogel, Mandy
    Poulain, Tanja
    Engel, Christoph
    Kiess, Wieland
    Fuchs, Michael
    Berger, Thomas
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (06) : 565 - 576
  • [30] CPPS and Voice-Source Parameters: Objective Analysis of the Singing Voice
    Baker, Calvin P.
    Sundberg, Johan
    Purdy, Suzanne C.
    Rakena, Te Oti
    Leao, Sylvia H. de S.
    [J]. JOURNAL OF VOICE, 2024, 38 (03) : 549 - 560