MLMT-CNN for object detection and segmentation in multi-layer and multi-spectral images

被引：3

作者：

Almahasneh, Majedaldein ^{[1
]}

Paiement, Adeline ^{[2
]}

Xie, Xianghua ^{[1
]}

Aboudarham, Jean ^{[3
]}

机构：

[1] Swansea Univ, Dept Comp Sci, Swansea, W Glam, Wales

[2] Univ Toulon & Var, Aix Marseille Univ, LIS, CNRS, Marseille, France

[3] Observ Paris PSL, Paris, France

来源：

MACHINE VISION AND APPLICATIONS | 2022年 / 33卷 / 01期

关键词：

Image segmentation; object detection; deep learning; weakly supervised learning; multi-spectral images; solar image analysis; solar active regions; ALGORITHM;

D O I：

10.1007/s00138-021-01261-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Precisely localising solar Active Regions (AR) from multi-spectral images is a challenging but important task in understanding solar activity and its influence on space weather. A main challenge comes from each modality capturing a different location of the 3D objects, as opposed to typical multi-spectral imaging scenarios where all image bands observe the same scene. Thus, we refer to this special multi-spectral scenario as multi-layer. We present a multi-task deep learning framework that exploits the dependencies between image bands to produce 3D AR localisation (segmentation and detection) where different image bands (and physical locations) have their own set of results. Furthermore, to address the difficulty of producing dense AR annotations for training supervised machine learning (ML) algorithms, we adapt a training strategy based on weak labels (i.e. bounding boxes) in a recursive manner. We compare our detection and segmentation stages against baseline approaches for solar image analysis (multi-channel coronal hole detection, SPOCA for ARs) and state-of-the-art deep learning methods (Faster RCNN, U-Net). Additionally, both detection and segmentation stages are quantitatively validated on artificially created data of similar spatial configurations made from annotated multi-modal magnetic resonance images. Our framework achieves an average of 0.72 IoU (segmentation) and 0.90 F1 score (detection) across all modalities, comparing to the best performing baseline methods with scores of 0.53 and 0.58, respectively, on the artificial dataset, and 0.84 F1 score in the AR detection task comparing to baseline of 0.82 F1 score. Our segmentation results are qualitatively validated by an expert on real ARs.

引用

页数：15

共 37 条

[21] SSD: Single Shot MultiBox Detector
Liu, Wei
Anguelov, Dragomir
Erhan, Dumitru
Szegedy, Christian
Reed, Scott
Fu, Cheng-Yang
Berg, Alexander C.
[J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
[22] The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS)
Menze, Bjoern H.
Jakab, Andras
Bauer, Stefan
Kalpathy-Cramer, Jayashree
Farahani, Keyvan
Kirby, Justin
Burren, Yuliya
Porz, Nicole
Slotboom, Johannes
Wiest, Roland
Lanczi, Levente
Gerstner, Elizabeth
Weber, Marc-Andre
Arbel, Tal
Avants, Brian B.
Ayache, Nicholas
Buendia, Patricia
Collins, D. Louis
Cordier, Nicolas
Corso, Jason J.
Criminisi, Antonio
Das, Tilak
Delingette, Herve
Demiralp, Cagatay
Durst, Christopher R.
Dojat, Michel
Doyle, Senan
Festa, Joana
Forbes, Florence
Geremia, Ezequiel
Glocker, Ben
Golland, Polina
Guo, Xiaotao
Hamamci, Andac
Iftekharuddin, Khan M.
Jena, Raj
John, Nigel M.
Konukoglu, Ender
Lashkari, Danial
Mariz, Jose Antonio
Meier, Raphael
Pereira, Sergio
Precup, Doina
Price, Stephen J.
Raviv, Tammy Riklin
Reza, Syed M. S.
Ryan, Michael
Sarikaya, Duygu
Schwartz, Lawrence
Shin, Hoo-Chang
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (10) : 1993 - 2024
[23] Mohajerani S., 2018, IEEE MMSP
[24] Mohajerani S, 2019, INT GEOSCI REMOTE SE, P1029, DOI [10.1109/igarss.2019.8898776, 10.1109/IGARSS.2019.8898776]
[25] Penatti OAB, 2015, IEEE COMPUT SOC CONF
[26] You Only Look Once: Unified, Real-Time Object Detection
Redmon, Joseph
Divvala, Santosh
Girshick, Ross
Farhadi, Ali
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 779 - 788
[27] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Ren, Shaoqing
He, Kaiming
Girshick, Ross
Sun, Jian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
[28] Fractal-based fuzzy technique for detection of active regions from solar images
Revathy, K
Lekshmi, S
Nayar, SRP
[J]. SOLAR PHYSICS, 2005, 228 (1-2) : 43 - 53
[29] Ronneberger O., 2015, P MED IM COMP COMP A, P234, DOI DOI 10.48550/ARXIV.1505.04597
[30] Shelhamer E., 2017, IEEE TPAMI

← 1 2 3 4 →