Estimating Rainfall from Surveillance Audio Based on Parallel Network with Multi-Scale Fusion and Attention Mechanism

被引:6
|
作者
Chen, Mingzheng [1 ,2 ,3 ]
Wang, Xing [1 ,2 ,3 ,4 ]
Wang, Meizhen [1 ,2 ,3 ]
Liu, Xuejun [1 ,2 ,3 ]
Wu, Yong [5 ]
Wang, Xiaochu [1 ,2 ,3 ]
机构
[1] Nanjing Normal Univ, Minist Educ, Key Lab Virtual Geog Environm, Nanjing 210023, Peoples R China
[2] State Key Lab Cultivat Base Geog Environm Evolut, Nanjing 210023, Peoples R China
[3] Jiangsu Ctr Collaborat Innovat Geog Informat Reso, Nanjing 210023, Peoples R China
[4] Univ Vienna, Dept Geog & Reg Res, A-1010 Vienna, Austria
[5] Fujian Normal Univ, Inst Geog, Fuzhou 350000, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
rainfall estimation; surveillance audio; machine learning; multi-scale fusion; CLASSIFICATION; RECOGNITION; RESOLUTION;
D O I
10.3390/rs14225750
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Rainfall data have a profound significance for meteorology, climatology, hydrology, and environmental sciences. However, existing rainfall observation methods (including ground-based rain gauges and radar-/satellite-based remote sensing) are not efficient in terms of spatiotemporal resolution and cannot meet the needs of high-resolution application scenarios (urban waterlogging, emergency rescue, etc.). Widespread surveillance cameras have been regarded as alternative rain gauges in existing studies. Surveillance audio, through exploiting their nonstop use to record rainfall acoustic signals, should be considered a type of data source to obtain high-resolution and all-weather data. In this study, a method named parallel neural network based on attention mechanisms and multi-scale fusion (PNNAMMS) is proposed for automatically classifying rainfall levels by surveillance audio. The proposed model employs a parallel dual-channel network with spatial channel extracting the frequency domain correlation, and temporal channel capturing the time-domain continuity of the rainfall sound. Additionally, attention mechanisms are used on the two channels to obtain significant spatiotemporal elements. A multi-scale fusion method was adopted to fuse different scale features in the spatial channel for more robust performance in complex surveillance scenarios. In experiments showed that our method achieved an estimation accuracy of 84.64% for rainfall levels and outperformed previously proposed methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation
    Fengqin Yao
    Shengke Wang
    Laihui Ding
    Guoqiang Zhong
    Shu Li
    Zhiwei Xu
    Cognitive Computation, 2024, 16 : 366 - 376
  • [22] Attention-Guided Multi-Scale Fusion Network for Similar Objects Semantic Segmentation
    Yao, Fengqin
    Wang, Shengke
    Ding, Laihui
    Zhong, Guoqiang
    Li, Shu
    Xu, Zhiwei
    COGNITIVE COMPUTATION, 2024, 16 (01) : 366 - 376
  • [23] A novel sleep staging network based on multi-scale dual attention
    Wang, Huafeng
    Lu, Chonggang
    Zhang, Qi
    Hu, Zhimin
    Yuan, Xiaodong
    Zhang, Pingshu
    Liu, Wanquan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 74
  • [24] Multi-scale generative adversarial inpainting network based on cross-layer attention transfer mechanism
    Shao, Mingwen
    Zhang, Wentao
    Zuo, Wangmeng
    Meng, Deyu
    KNOWLEDGE-BASED SYSTEMS, 2020, 196 (196)
  • [25] Semantic Segmentation of Urban Airborne LiDAR Point Clouds Based on Fusion Attention Mechanism and Multi-Scale Features
    Wang, Jingxue
    Li, Huan
    Xu, Zhenghui
    Xie, Xiao
    REMOTE SENSING, 2023, 15 (21)
  • [26] AMFF-Net: An attention-based multi-scale feature fusion network for allergic pollen detection
    Li, Jianqiang
    Wang, Quanzeng
    Xiong, Chengyao
    Zhao, Linna
    Cheng, Wenxiu
    Xu, Xi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [27] AGGN: Attention-based glioma grading network with multi-scale feature extraction and multi-modal information fusion
    Wu, Peishu
    Wang, Zidong
    Zheng, Baixun
    Li, Han
    Alsaadi, Fuad E.
    Zeng, Nianyin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
  • [28] Multi-Scale Frequency-Spatial Domain Attention Fusion Network for Building Extraction in Remote Sensing Images
    Liu, Jia
    Chen, Hao
    Li, Zuhe
    Gu, Hang
    ELECTRONICS, 2024, 13 (23):
  • [29] Detection and Localization of Myocardial Infarction Based on Multi-Scale ResNet and Attention Mechanism
    Cao, Yang
    Liu, Wenyan
    Zhang, Shuang
    Xu, Lisheng
    Zhu, Baofeng
    Cui, Huiying
    Geng, Ning
    Han, Hongguang
    Greenwald, Stephen E.
    FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [30] Multi-scale ResNet and BiGRU automatic sleep staging based on attention mechanism
    Liu, Changyuan
    Yin, Yunfu
    Sun, Yuhan
    Ersoy, Okan K.
    PLOS ONE, 2022, 17 (06):