Two-stage streaming keyword detection and localization with multi-scale depthwise temporal convolution

被引:0
作者
Hou, Jingyong [1 ]
Xie, Lei [1 ]
Zhang, Shilei [2 ]
机构
[1] Audio, Speech and Language Processing Group (ASLP@NPU), ASGO, School of Computer Science, Northwestern Polytechnical University, Xi'an, China
[2] China Mobile Research Institute, China
关键词
74;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:28 / 42
相关论文
共 50 条
[21]   Gated Multi-Scale Transformer for Temporal Action Localization [J].
Yang, Jin ;
Wei, Ping ;
Ren, Ziyang ;
Zheng, Nanning .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :5705-5717
[22]   Two-Stage Multi-Scale Fault Diagnosis Method for Rolling Bearings with Imbalanced Data [J].
Zheng, Minglei ;
Chang, Qi ;
Man, Junfeng ;
Liu, Yi ;
Shen, Yiping .
MACHINES, 2022, 10 (05)
[23]   Tsnet: a two-stage network for image dehazing with multi-scale fusion and adaptive learning [J].
Gong, Xiaolin ;
Zheng, Zehan ;
Du, Heyuan .
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (10) :7119-7130
[24]   Neural Image Compression with Multi-scale Depthwise Separable Dilated Convolution and Multi-distribution Mixture Entropy Model [J].
Yang, Dongjian ;
Fan, Xiaopeng ;
Meng, Xiandong ;
Zhao, Debin .
2025 DATA COMPRESSION CONFERENCE, DCC, 2025, :411-411
[25]   Multi-scale convolution target detection algorithm with feature pyramid [J].
Lin Z.-J. ;
Luo Z. ;
Zhao L. ;
Lu D.-M. .
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2019, 53 (03) :533-540
[26]   Two-stage point cloud registration using multi-scale edge convolution for digital twin-based bridge construction progress monitoring [J].
Zhang, Hao ;
Yan, Junwei ;
Yang, Jun ;
Meng, Wei ;
Chen, Shujie .
AUTOMATION IN CONSTRUCTION, 2025, 178
[27]   RDM2: a two-stage model based on residual learning diffusion model and multi-scale convolution for Low Dose CT denoisingRDM2: a two-stage model based on residual learning diffusion model and multi-scale convolution for Low Dose CT denoisingZ. Jiang et al. [J].
Zhencun Jiang ;
Kangrui Ren ;
Kefan Wang ;
Zhongjie Wang .
Applied Intelligence, 55 (13)
[28]   MSSD: multi-scale object detector based on spatial pyramid depthwise convolution and efficient channel attention mechanism [J].
Zhou, Yipeng ;
Qian, Huaming ;
Ding, Peng .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (05)
[29]   MSSD: multi-scale object detector based on spatial pyramid depthwise convolution and efficient channel attention mechanism [J].
Yipeng Zhou ;
Huaming Qian ;
Peng Ding .
Journal of Real-Time Image Processing, 2023, 20
[30]   MDWConv:CNN based on multi-scale atrous pyramid and depthwise separable convolution for long time series forecasting [J].
Tian, Guangpo ;
Xu, Yunyang ;
Ma, Xiang ;
Li, Xuemei ;
Zhang, Caiming .
NEURAL NETWORKS, 2025, 185