MLMT-CNN for object detection and segmentation in multi-layer and multi-spectral images

被引:3
作者
Almahasneh, Majedaldein [1 ]
Paiement, Adeline [2 ]
Xie, Xianghua [1 ]
Aboudarham, Jean [3 ]
机构
[1] Swansea Univ, Dept Comp Sci, Swansea, W Glam, Wales
[2] Univ Toulon & Var, Aix Marseille Univ, LIS, CNRS, Marseille, France
[3] Observ Paris PSL, Paris, France
关键词
Image segmentation; object detection; deep learning; weakly supervised learning; multi-spectral images; solar image analysis; solar active regions; ALGORITHM;
D O I
10.1007/s00138-021-01261-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Precisely localising solar Active Regions (AR) from multi-spectral images is a challenging but important task in understanding solar activity and its influence on space weather. A main challenge comes from each modality capturing a different location of the 3D objects, as opposed to typical multi-spectral imaging scenarios where all image bands observe the same scene. Thus, we refer to this special multi-spectral scenario as multi-layer. We present a multi-task deep learning framework that exploits the dependencies between image bands to produce 3D AR localisation (segmentation and detection) where different image bands (and physical locations) have their own set of results. Furthermore, to address the difficulty of producing dense AR annotations for training supervised machine learning (ML) algorithms, we adapt a training strategy based on weak labels (i.e. bounding boxes) in a recursive manner. We compare our detection and segmentation stages against baseline approaches for solar image analysis (multi-channel coronal hole detection, SPOCA for ARs) and state-of-the-art deep learning methods (Faster RCNN, U-Net). Additionally, both detection and segmentation stages are quantitatively validated on artificially created data of similar spatial configurations made from annotated multi-modal magnetic resonance images. Our framework achieves an average of 0.72 IoU (segmentation) and 0.90 F1 score (detection) across all modalities, comparing to the best performing baseline methods with scores of 0.53 and 0.58, respectively, on the artificial dataset, and 0.84 F1 score in the AR detection task comparing to baseline of 0.82 F1 score. Our segmentation results are qualitatively validated by an expert on real ARs.
引用
收藏
页数:15
相关论文
共 37 条
  • [21] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
  • [22] The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS)
    Menze, Bjoern H.
    Jakab, Andras
    Bauer, Stefan
    Kalpathy-Cramer, Jayashree
    Farahani, Keyvan
    Kirby, Justin
    Burren, Yuliya
    Porz, Nicole
    Slotboom, Johannes
    Wiest, Roland
    Lanczi, Levente
    Gerstner, Elizabeth
    Weber, Marc-Andre
    Arbel, Tal
    Avants, Brian B.
    Ayache, Nicholas
    Buendia, Patricia
    Collins, D. Louis
    Cordier, Nicolas
    Corso, Jason J.
    Criminisi, Antonio
    Das, Tilak
    Delingette, Herve
    Demiralp, Cagatay
    Durst, Christopher R.
    Dojat, Michel
    Doyle, Senan
    Festa, Joana
    Forbes, Florence
    Geremia, Ezequiel
    Glocker, Ben
    Golland, Polina
    Guo, Xiaotao
    Hamamci, Andac
    Iftekharuddin, Khan M.
    Jena, Raj
    John, Nigel M.
    Konukoglu, Ender
    Lashkari, Danial
    Mariz, Jose Antonio
    Meier, Raphael
    Pereira, Sergio
    Precup, Doina
    Price, Stephen J.
    Raviv, Tammy Riklin
    Reza, Syed M. S.
    Ryan, Michael
    Sarikaya, Duygu
    Schwartz, Lawrence
    Shin, Hoo-Chang
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (10) : 1993 - 2024
  • [23] Mohajerani S., 2018, IEEE MMSP
  • [24] Mohajerani S, 2019, INT GEOSCI REMOTE SE, P1029, DOI [10.1109/igarss.2019.8898776, 10.1109/IGARSS.2019.8898776]
  • [25] Penatti OAB, 2015, IEEE COMPUT SOC CONF
  • [26] You Only Look Once: Unified, Real-Time Object Detection
    Redmon, Joseph
    Divvala, Santosh
    Girshick, Ross
    Farhadi, Ali
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 779 - 788
  • [27] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [28] Fractal-based fuzzy technique for detection of active regions from solar images
    Revathy, K
    Lekshmi, S
    Nayar, SRP
    [J]. SOLAR PHYSICS, 2005, 228 (1-2) : 43 - 53
  • [29] Ronneberger O., 2015, P MED IM COMP COMP A, P234, DOI DOI 10.48550/ARXIV.1505.04597
  • [30] Shelhamer E., 2017, IEEE TPAMI