An Anchor-Free Detector for Continuous Speech Keyword Spotting

被引:0
|
作者
Zhao, Zhiyuan [1 ]
Tang, Chuanxin [1 ]
Yao, Chengdong [2 ]
Luo, Chong [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
[2] Univ Technol Sydney, Sydney, NSW, Australia
来源
关键词
keyword spotting; continuous speech keyword spotting; speech recognition; anchor-free detector; open dataset;
D O I
10.21437/Interspeech.2022-296
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Continuous Speech Keyword Spotting (CSKWS) is a task to detect predefined keywords in a continuous speech. In this paper, we regard CSKWS as a one-dimensional object detection task and propose a novel anchor-free detector, named AF-KWS, to solve the problem. AF-KWS directly regresses the center locations and lengths of the keywords through a single-stage deep neural network. In particular, AF-KWS is tailored for this speech task as we introduce an auxiliary unknown class to exclude other words from non-speech or silent background. We have built two benchmark datasets named LibriTop-20 and continuous meeting analysis keywords (CMAK) dataset for CSKWS. Evaluations on these two datasets show that our proposed AF-KWS outperforms reference schemes by a large margin, and therefore provides a decent baseline for future research.
引用
收藏
页码:3228 / 3232
页数:5
相关论文
共 50 条
  • [31] EFR-FCOS: enhancing feature reuse for anchor-free object detector
    Liao, Yongwei
    Li, Zhenjun
    Feng, Wenlong
    Zhang, Yibin
    Zhou, Bing
    PEERJ, 2024, 10 : 1 - 23
  • [32] Pneumonia detection based on RSNA dataset and anchor-free deep learning detector
    Wu, Linghua
    Zhang, Jing
    Wang, Yilin
    Ding, Rong
    Cao, Yueqin
    Liu, Guiqin
    Liufu, Changsheng
    Xie, Baowei
    Kang, Shanping
    Liu, Rui
    Li, Wenle
    Guan, Furen
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [33] An Efficient Anchor-Free Defect Detector With Dynamic Receptive Field and Task Alignment
    Zuo, Fengyuan
    Liu, Jinhai
    Fu, Mingrui
    Wang, Lei
    Zhao, Zhen
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) : 8536 - 8547
  • [34] 3D Anchor-Free Lesion Detector on Computed Tomography Scans
    Zhang, Ning
    Wang, Dechun
    Sun, Xinzi
    Zhang, Pengfei
    Zhang, Chenxi
    Cao, Yu
    Liu, Benyuan
    2019 FIRST INTERNATIONAL CONFERENCE ON TRANSDISCIPLINARY AI (TRANSAI 2019), 2019, : 48 - 51
  • [35] HAFREE: A Heatmap-Based Anchor-Free Detector for Apple Defect Detection
    Bui Ngoc Han, Nguyen
    Lee, Ju-Hwan
    Thanh Vu, Dang
    Murtza, Iqbal
    Kim, Hyoung-Gook
    Kim, Jin-Young
    IEEE ACCESS, 2024, 12 : 182799 - 182813
  • [36] AccLoc: Anchor-Free and two-stage detector for accurate object localization
    Piao, Zhengquan
    Wang, Junbo
    Tang, Linbo
    Zhao, Baojun
    Wang, Wenzheng
    PATTERN RECOGNITION, 2022, 126
  • [37] CPS-Det: An Anchor-Free Based Rotation Detector for Ship Detection
    Yang, Yi
    Pan, Zongxu
    Hu, Yuxin
    Ding, Chibiao
    REMOTE SENSING, 2021, 13 (11)
  • [38] A Context-Aware Anchor-free Tiny Object Detector for Aerial Images
    Chen, Li-Syuan
    Way, Der-Lor
    Shih, Zen-Chung
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [39] Pneumonia detection based on RSNA dataset and anchor-free deep learning detector
    Linghua Wu
    Jing Zhang
    Yilin Wang
    Rong Ding
    Yueqin Cao
    Guiqin Liu
    Changsheng Liufu
    Baowei Xie
    Shanping Kang
    Rui Liu
    Wenle Li
    Furen Guan
    Scientific Reports, 14
  • [40] Anchor point detection for continuous speech recognition in Spanish: The spotting of phonetic events
    Leandro, MA
    Pardo, JM
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2336 - 2339