共 45 条
[31]
Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics
[J].
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
2012, 20 (06)
:1759-1770
[32]
wav2vec: Unsupervised Pre-training for Speech Recognition
[J].
INTERSPEECH 2019,
2019,
:3465-3469
[33]
Seetharaman P., 2020, PROC WORKSHOP SELF S
[34]
Seetharaman P, 2017, IEEE WORK APPL SIG, P36, DOI 10.1109/WASPAA.2017.8169990
[35]
EXPLORING WAVLM ON SPEECH ENHANCEMENT
[J].
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT,
2022,
:451-457
[36]
Stoller D., 2018, ARXIV PREPRINT ARXIV, DOI DOI 10.5281/ZENODO.1492417
[37]
Stoter F.R., 2019, J. Open Source Softw., V4, P1667
[38]
The 2018 Signal Separation Evaluation Campaign
[J].
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018),
2018, 10891
:293-305
[39]
Utilizing Self-supervised Representations for MOS Prediction
[J].
INTERSPEECH 2021,
2021,
:2781-2785
[40]
Turian J., 2022, P INT C NEUR INF PRO, P125