Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting

被引:5
作者
Kim, Byeonggeun [1 ]
Yang, Seunghan [1 ]
Chung, Inseop [1 ,2 ]
Chang, Simyung [1 ]
机构
[1] Qualcomm Korea YH, Qualcomm AI Res, Seoul, South Korea
[2] Seoul Natl Univ, Seoul, South Korea
来源
INTERSPEECH 2022 | 2022年
关键词
Few-shot learning; Open-set Recognition; Keyword Spotting; Dummy Prototype; Prototypical Networks;
D O I
10.21437/Interspeech.2022-921
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Keyword spotting is the task of detecting a keyword in streaming audio. Conventional keyword spotting targets predefined keywords classification, but there is growing attention in few-shot (query-by-example) keyword spotting, e.g., N-way classification given M-shot support samples. Moreover, in real-world scenarios, there can be utterances from unexpected categories (open-set) which need to be rejected rather than classified as one of the N classes. Combining the two needs, we tackle few-shot open-set keyword spotting with a new benchmark setting, named splitGSC. We propose episode-known dummy prototypes based on metric learning to detect an open-set better and introduce a simple and powerful approach, Dummy Prototypical Networks (D-ProtoNets). Our D-ProtoNets shows clear margins compared to recent few-shot open-set recognition (FSOSR) approaches in the suggested splitGSC. We also verify our method on a standard benchmark, miniImageNet, and D-ProtoNets shows the state-of-the-art open-set detection rate in FSOSR.
引用
收藏
页码:4621 / 4625
页数:5
相关论文
共 50 条
  • [1] Few-Shot Keyword Spotting With Prototypical Networks
    Parnami, Archit
    Lee, Minwoo
    PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 277 - 283
  • [2] TASK-AGNOSTIC OPEN-SET PROTOTYPE FOR FEW-SHOT OPEN-SET RECOGNITION
    Kim, Byeonggeun
    Lee, Jun-Tae
    Shim, Kyuhong
    Chang, Simyung
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 31 - 35
  • [3] Few-shot Open-set Recognition Using Background as Unknowns
    Song, Nan
    Zhang, Chi
    Lin, Guosheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5970 - 5979
  • [4] Overall positive prototype for few-shot open-set recognition
    Sun, Liang-Yu
    Chu, Wei-Ta
    PATTERN RECOGNITION, 2024, 151
  • [5] Few-Shot Keyword Spotting in Any Language
    Mazumder, Mark
    Banbury, Colby
    Meyer, Josh
    Warden, Pete
    Reddi, Vijay Janapa
    INTERSPEECH 2021, 2021, : 4214 - 4218
  • [6] Few-shot open-set recognition via pairwise discriminant aggregation
    Jin, Jian
    Shen, Yang
    Fu, Zhenyong
    Yang, Jian
    NEUROCOMPUTING, 2024, 602
  • [7] Learning Relative Feature Displacement for Few-Shot Open-Set Recognition
    Deng, Shule
    Yu, Jin-Gang
    Wu, Zihao
    Gao, Hongxia
    Li, Yansheng
    Yang, Yang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5763 - 5774
  • [8] Feature-semantic augmentation network for few-shot open-set recognition
    Huang, Xilang
    Choi, Seon Han
    PATTERN RECOGNITION, 2024, 156
  • [9] Towards Open-Set APT Malware Classification under Few-Shot Setting
    Bao, Huaifeng
    Wang, Wen
    Liu, Feng
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6844 - 6849
  • [10] TNPNet: An approach to Few-shot open-set recognition via contextual transductive learning
    Wu, Shaoling
    Luo, Huilan
    Lin, Xiaoming
    NEUROCOMPUTING, 2025, 621