Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting

被引：5

作者：

Kim, Byeonggeun ^{[1
]}

Yang, Seunghan ^{[1
]}

Chung, Inseop ^{[1
,2
]}

Chang, Simyung ^{[1
]}

机构：

[1] Qualcomm Korea YH, Qualcomm AI Res, Seoul, South Korea

[2] Seoul Natl Univ, Seoul, South Korea

来源：

INTERSPEECH 2022 | 2022年

关键词：

Few-shot learning; Open-set Recognition; Keyword Spotting; Dummy Prototype; Prototypical Networks;

D O I：

10.21437/Interspeech.2022-921

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Keyword spotting is the task of detecting a keyword in streaming audio. Conventional keyword spotting targets predefined keywords classification, but there is growing attention in few-shot (query-by-example) keyword spotting, e.g., N-way classification given M-shot support samples. Moreover, in real-world scenarios, there can be utterances from unexpected categories (open-set) which need to be rejected rather than classified as one of the N classes. Combining the two needs, we tackle few-shot open-set keyword spotting with a new benchmark setting, named splitGSC. We propose episode-known dummy prototypes based on metric learning to detect an open-set better and introduce a simple and powerful approach, Dummy Prototypical Networks (D-ProtoNets). Our D-ProtoNets shows clear margins compared to recent few-shot open-set recognition (FSOSR) approaches in the suggested splitGSC. We also verify our method on a standard benchmark, miniImageNet, and D-ProtoNets shows the state-of-the-art open-set detection rate in FSOSR.

引用

页码：4621 / 4625

页数：5

共 50 条

[1] Few-Shot Keyword Spotting With Prototypical Networks
Parnami, Archit
Lee, Minwoo
PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 277 - 283
[2] TASK-AGNOSTIC OPEN-SET PROTOTYPE FOR FEW-SHOT OPEN-SET RECOGNITION
Kim, Byeonggeun
Lee, Jun-Tae
Shim, Kyuhong
Chang, Simyung
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 31 - 35
[3] Few-shot Open-set Recognition Using Background as Unknowns
Song, Nan
Zhang, Chi
Lin, Guosheng
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5970 - 5979
[4] Overall positive prototype for few-shot open-set recognition
Sun, Liang-Yu
Chu, Wei-Ta
PATTERN RECOGNITION, 2024, 151
[5] Few-Shot Keyword Spotting in Any Language
Mazumder, Mark
Banbury, Colby
Meyer, Josh
Warden, Pete
Reddi, Vijay Janapa
INTERSPEECH 2021, 2021, : 4214 - 4218
[6] Few-shot open-set recognition via pairwise discriminant aggregation
Jin, Jian
Shen, Yang
Fu, Zhenyong
Yang, Jian
NEUROCOMPUTING, 2024, 602
[7] Learning Relative Feature Displacement for Few-Shot Open-Set Recognition
Deng, Shule
Yu, Jin-Gang
Wu, Zihao
Gao, Hongxia
Li, Yansheng
Yang, Yang
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5763 - 5774
[8] Feature-semantic augmentation network for few-shot open-set recognition
Huang, Xilang
Choi, Seon Han
PATTERN RECOGNITION, 2024, 156
[9] Towards Open-Set APT Malware Classification under Few-Shot Setting
Bao, Huaifeng
Wang, Wen
Liu, Feng
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6844 - 6849
[10] TNPNet: An approach to Few-shot open-set recognition via contextual transductive learning
Wu, Shaoling
Luo, Huilan
Lin, Xiaoming
NEUROCOMPUTING, 2025, 621

← 1 2 3 4 5 →