Deep Learning for HABs Prediction with Multimodal Fusion

被引:1
作者
Zhao, Fei [1 ]
Zhang, Chengcui [1 ]
机构
[1] Univ Alabama Birmingham, Birmingham, AL 35294 USA
来源
31ST ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2023 | 2023年
关键词
Geolocation; Computer Vision; Deep Learning; Harmful Algal Blooms;
D O I
10.1145/3589132.3628370
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Harmful Algal Blooms (HABs) present significant environmental and public health threats. Recent machine learning-based HABs monitoring methods often rely solely on unimodal data, e.g., satellite imagery, overlooking crucial environmental factors such as temperature. Moreover, existing multi-modal approaches grapple with real-time applicability and generalizability challenges due to the use of ensemble methodologies and hard-coded geolocation clusters. Addressing these gaps, this paper presents a novel deep learning model using a single-model-based multi-task framework. This framework is designed to segment water bodies and predict HABs severity levels concurrently, enabling the model to focus on areas of interest, thereby enhancing prediction accuracy. Our model integrates multimodal inputs, i.e., satellite imagery, elevation data, temperature readings, and geolocation details, via a dual-branch architecture: the Satellite-Elevation (SE) branch and the TemperatureGeolocation (TG) branch. Satellite and elevation data in the SE branch, being spatially coherent, assist in water area detection and feature extraction. Meanwhile, the TG branch, using sequential temperature data and geolocation information, captures temporal algal growth patterns and adjusts for temperature variations influenced by regional climatic differences, ensuring the model's adaptability across different geographic regions. Additionally, we propose a geometric multimodal focal loss to further enhance representation learning. On the Tick-Tick Bloom (TTB) dataset, our approach outperforms the SOTA methods by 15.65%.
引用
收藏
页码:17 / 18
页数:2
相关论文
共 50 条
[21]   Multimodal Sentiment Analysis using Deep Learning Fusion Techniques and Transformers [J].
Bin Habib, Muhaimin ;
Hafiz, Md. Ferdous Bin ;
Khan, Niaz Ashraf ;
Hossain, Sohrab .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) :856-863
[22]   Hybrid multimodal fusion with deep learning for rolling bearing fault diagnosis [J].
Che, Changchang ;
Wang, Huawei ;
Ni, Xiaomei ;
Lin, Ruiguan .
MEASUREMENT, 2021, 173
[23]   Deep Multimodal Learning and Fusion Based Intelligent Fault Diagnosis Approach [J].
Li H. ;
Huang J. ;
Huang J. ;
Chai S. ;
Zhao L. ;
Xia Y. .
Journal of Beijing Institute of Technology (English Edition), 2021, 30 (02) :172-185
[24]   Multimodal feature fusion in deep learning for comprehensive dental condition classification [J].
Hsieh, Shang-Ting ;
Cheng, Ya-Ai .
JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2024, 32 (02) :303-321
[25]   Deep learning in multimodal remote sensing data fusion: A comprehensive review [J].
Li, Jiaxin ;
Hong, Danfeng ;
Gao, Lianru ;
Yao, Jing ;
Zheng, Ke ;
Zhang, Bing ;
Chanussot, Jocelyn .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
[26]   Prediction of mechanical properties of composite materials using multimodal fusion learning [J].
Song, Lei ;
Wang, Donglei ;
Liu, Xuwang ;
Yin, Aijun ;
Long, Zhendong .
SENSORS AND ACTUATORS A-PHYSICAL, 2023, 358
[27]   Prediction of Relapse-free Survival of NSCLC Patients Through Multimodal Data Fusion Using Deep Learning Model [J].
Kim, H. R. ;
Beck, K. ;
Kang, J. H. ;
Hong, H. .
JOURNAL OF THORACIC ONCOLOGY, 2024, 19 (10) :S387-S387
[28]   A Deep Multimodal Representation Learning Framework for Accurate Molecular Properties Prediction [J].
Yang, Yuxin ;
Wang, Zixu ;
Ahadian, Pegah ;
Jerger, Abby ;
Zucker, Jeremy ;
Feng, Song ;
Cheng, Feixiong ;
Guan, Qiang .
PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, :760-765
[29]   Breast cancer prediction using gated attentive multimodal deep learning [J].
Safak Kayikci ;
Taghi M. Khoshgoftaar .
Journal of Big Data, 10
[30]   Onsite intensity prediction for earthquake early warning with multimodal deep learning [J].
Zhu, Jingbao ;
Li, Shanyou ;
Ma, Qiang ;
Song, Jindong .
SOIL DYNAMICS AND EARTHQUAKE ENGINEERING, 2025, 195