An Anchor-Free Detector for Continuous Speech Keyword Spotting

被引:0
|
作者
Zhao, Zhiyuan [1 ]
Tang, Chuanxin [1 ]
Yao, Chengdong [2 ]
Luo, Chong [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
[2] Univ Technol Sydney, Sydney, NSW, Australia
来源
关键词
keyword spotting; continuous speech keyword spotting; speech recognition; anchor-free detector; open dataset;
D O I
10.21437/Interspeech.2022-296
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Continuous Speech Keyword Spotting (CSKWS) is a task to detect predefined keywords in a continuous speech. In this paper, we regard CSKWS as a one-dimensional object detection task and propose a novel anchor-free detector, named AF-KWS, to solve the problem. AF-KWS directly regresses the center locations and lengths of the keywords through a single-stage deep neural network. In particular, AF-KWS is tailored for this speech task as we introduce an auxiliary unknown class to exclude other words from non-speech or silent background. We have built two benchmark datasets named LibriTop-20 and continuous meeting analysis keywords (CMAK) dataset for CSKWS. Evaluations on these two datasets show that our proposed AF-KWS outperforms reference schemes by a large margin, and therefore provides a decent baseline for future research.
引用
收藏
页码:3228 / 3232
页数:5
相关论文
共 50 条
  • [1] A fully convolutional anchor-free object detector
    Taoshan Zhang
    Zheng Li
    Zhikuan Sun
    Lin Zhu
    The Visual Computer, 2023, 39 : 569 - 580
  • [2] An Anchor-Free Detector to Detect Small Objects
    Hu, Tingting
    Zhang, Tao
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [3] TomatoDet: Anchor-free detector for tomato detection
    Liu, Guoxu
    Hou, Zengtian
    Liu, Hongtao
    Liu, Jun
    Zhao, Wenjie
    Li, Kun
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [4] A fully convolutional anchor-free object detector
    Zhang, Taoshan
    Li, Zheng
    Sun, Zhikuan
    Zhu, Lin
    VISUAL COMPUTER, 2023, 39 (02): : 569 - 580
  • [5] FCOS: A Simple and Strong Anchor-Free Object Detector
    Tian, Zhi
    Shen, Chunhua
    Chen, Hao
    He, Tong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (04) : 1922 - 1933
  • [6] Fashion Image Search via Anchor-Free Detector
    Gao, Shanchuan
    Zeng, Fankai
    Cheng, Lu
    Fan, Jicong
    Zhao, Mingbo
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 416 - 425
  • [7] An Efficient Anchor-Free Face Detector with Attention Mechanisms
    Zhu, Xiangxian
    Lou, Yilun
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [8] An Efficient Anchor-Free Face Detector with Attention Mechanisms
    Zhu, Xiangxian
    Lou, Yilun
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [9] ElDet: An Anchor-Free General Ellipse Object Detector
    Wang, Tianhao
    Lu, Changsheng
    Shao, Ming
    Yuan, Xiaohui
    Xia, Siyu
    COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 223 - 238
  • [10] Keyword spotting in continuous speech using convolutional neural network
    Rostami, Amir Mohammad
    Karimi, Ali
    Akhaee, Mohammad Ali
    SPEECH COMMUNICATION, 2022, 142 : 15 - 21