共 23 条
- [2] BYTECOVER2: TOWARDS DIMENSIONALITY REDUCTION OF LATENT EMBEDDING FOR EFFICIENT COVER SONG IDENTIFICATION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 616 - 620
- [3] BYTECOVER: COVER SONG IDENTIFICATION VIA MULTI-LOSS TRAINING [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 551 - 555
- [4] Ellis DPW, 2007, INT CONF ACOUST SPEE, P1429
- [5] Conformer: Convolution-augmented Transformer for Speech Recognition [J]. INTERSPEECH 2020, 2020, : 5036 - 5040
- [6] Guo RQ, 2020, PR MACH LEARN RES, V119
- [7] Hu S., 2022, Interspeech
- [8] Focal Loss for Dense Object Detection [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
- [9] Bag of Tricks and A Strong Baseline for Deep Person Re-identification [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1487 - 1495
- [10] Marolt M., 2006, ISMIR