An Anchor-Free Detector for Continuous Speech Keyword Spotting

被引:0
|
作者
Zhao, Zhiyuan [1 ]
Tang, Chuanxin [1 ]
Yao, Chengdong [2 ]
Luo, Chong [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
[2] Univ Technol Sydney, Sydney, NSW, Australia
来源
关键词
keyword spotting; continuous speech keyword spotting; speech recognition; anchor-free detector; open dataset;
D O I
10.21437/Interspeech.2022-296
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Continuous Speech Keyword Spotting (CSKWS) is a task to detect predefined keywords in a continuous speech. In this paper, we regard CSKWS as a one-dimensional object detection task and propose a novel anchor-free detector, named AF-KWS, to solve the problem. AF-KWS directly regresses the center locations and lengths of the keywords through a single-stage deep neural network. In particular, AF-KWS is tailored for this speech task as we introduce an auxiliary unknown class to exclude other words from non-speech or silent background. We have built two benchmark datasets named LibriTop-20 and continuous meeting analysis keywords (CMAK) dataset for CSKWS. Evaluations on these two datasets show that our proposed AF-KWS outperforms reference schemes by a large margin, and therefore provides a decent baseline for future research.
引用
收藏
页码:3228 / 3232
页数:5
相关论文
共 50 条
  • [11] Phoneme based acoustics keyword spotting in informal continuous speech
    Szöke, I
    Schwarz, P
    Matejka, P
    Burget, L
    Karafiát, M
    Cemocky, J
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2005, 3658 : 302 - 309
  • [12] Keyword spotting in continuous speech using convolutional neural network
    Rostami, Amir Mohammad
    Karimi, Ali
    Akhaee, Mohammad Ali
    Speech Communication, 2022, 142 : 15 - 21
  • [13] An anchor-free object detector with novel corner matching method
    Ma, Tingsong
    Tian, Wenhong
    Kuang, Ping
    Xie, Yuanlun
    KNOWLEDGE-BASED SYSTEMS, 2021, 224
  • [14] FlashNet: A Real-time Anchor-Free Face Detector
    Ge, Yongtao
    Wang, Qiang
    Sheng, Biyun
    Yang, Wankou
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 441 - 446
  • [15] MAOD: An Efficient Anchor-Free Object Detector Based on MobileDet
    Chen, Dong
    Shen, Hao
    IEEE ACCESS, 2020, 8 : 86564 - 86572
  • [16] ALODAD: An Anchor-Free Lightweight Object Detector for Autonomous Driving
    Liang, Tianjiao
    Bao, Hong
    Pan, Weiguo
    Pan, Feng
    IEEE ACCESS, 2022, 10 : 40701 - 40714
  • [17] Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion
    Laxmi Pandey
    Rajesh M. Hegde
    Circuits, Systems, and Signal Processing, 2019, 38 : 2767 - 2791
  • [18] Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion
    Pandey, Laxmi
    Hegde, Rajesh M.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (06) : 2767 - 2791
  • [19] FCOSR: A Simple Anchor-Free Rotated Detector for Aerial Object Detection
    Li, Zhonghua
    Hou, Biao
    Wu, Zitong
    Ren, Bo
    Yang, Chen
    REMOTE SENSING, 2023, 15 (23)
  • [20] AN ORIENTATION-AWARE ANCHOR-FREE DETECTOR FOR AERIAL OBJECT DETECTION
    Duan, Mudi
    Meng, Ran
    Xiao, Liang
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3075 - 3078