MLMT-CNN for object detection and segmentation in multi-layer and multi-spectral images

被引：3

作者：

Almahasneh, Majedaldein ^{[1
]}

Paiement, Adeline ^{[2
]}

Xie, Xianghua ^{[1
]}

Aboudarham, Jean ^{[3
]}

机构：

[1] Swansea Univ, Dept Comp Sci, Swansea, W Glam, Wales

[2] Univ Toulon & Var, Aix Marseille Univ, LIS, CNRS, Marseille, France

[3] Observ Paris PSL, Paris, France

来源：

MACHINE VISION AND APPLICATIONS | 2022年 / 33卷 / 01期

关键词：

Image segmentation; object detection; deep learning; weakly supervised learning; multi-spectral images; solar image analysis; solar active regions; ALGORITHM;

D O I：

10.1007/s00138-021-01261-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Precisely localising solar Active Regions (AR) from multi-spectral images is a challenging but important task in understanding solar activity and its influence on space weather. A main challenge comes from each modality capturing a different location of the 3D objects, as opposed to typical multi-spectral imaging scenarios where all image bands observe the same scene. Thus, we refer to this special multi-spectral scenario as multi-layer. We present a multi-task deep learning framework that exploits the dependencies between image bands to produce 3D AR localisation (segmentation and detection) where different image bands (and physical locations) have their own set of results. Furthermore, to address the difficulty of producing dense AR annotations for training supervised machine learning (ML) algorithms, we adapt a training strategy based on weak labels (i.e. bounding boxes) in a recursive manner. We compare our detection and segmentation stages against baseline approaches for solar image analysis (multi-channel coronal hole detection, SPOCA for ARs) and state-of-the-art deep learning methods (Faster RCNN, U-Net). Additionally, both detection and segmentation stages are quantitatively validated on artificially created data of similar spatial configurations made from annotated multi-modal magnetic resonance images. Our framework achieves an average of 0.72 IoU (segmentation) and 0.90 F1 score (detection) across all modalities, comparing to the best performing baseline methods with scores of 0.53 and 0.58, respectively, on the artificial dataset, and 0.84 F1 score in the AR detection task comparing to baseline of 0.82 F1 score. Our segmentation results are qualitatively validated by an expert on real ARs.

引用

页数：15

共 37 条

[1] Almahasneh M., 2021, ICPRAM
[2] [Anonymous], 2016, CVPR
[3] [Anonymous], 2015, ICCVW
[4] Bansal A., 2017, ARXIV PREPRINT ARXIV
[5] Benkhalil A., 2006, SOLAR PHY
[6] BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation
Dai, Jifeng
He, Kaiming
Sun, Jian
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1635 - 1643
[7] Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[8] Dalal N., 2005, P IEEE C COMP VIS PA
[9] Eitel A, 2015, IEEE INT C INT ROBOT, P681, DOI 10.1109/IROS.2015.7353446
[10] Gani M., 2021, MULTISPECTRAL OBJECT

← 1 2 3 4 →