An Automatic Glioma Segmentation System Using a Multilevel Attention Pyramid Scene Parsing Network

被引:17
作者
Zhang, Zhenyu [1 ]
Gao, Shouwei [1 ]
Huang, Zheng [2 ,3 ,4 ,5 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang, Peoples R China
[3] Chinese Acad Sci, Inst Robot, Shenyang, Peoples R China
[4] Chinese Acad Sci, Inst Intelligent Mfg, Shenyang, Peoples R China
[5] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Gliomas; segmentation; magnetic resonance imaging; MLAPSPNet; attention gates; feature fusion; context; MODIFIED U-NET; MRI; FEATURES;
D O I
10.2174/1573405616666201231100623
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Due to the significant variances in their shape and size, it is a challenging task to automatically segment gliomas. To improve the performance of glioma segmentation tasks, this paper proposed a multilevel attention pyramid scene parsing network (MLAPSPNet) that aggregates the multiscale context and multilevel features. Methods: First, T1 pre-contrast, T2-weighted fluid-attenuated inversion recovery (FLAIR) and T1 post-contrast sequences of each slice are combined to form the input. Afterwards, image normalization and augmentation techniques are applied to accelerate the training process and avoid overfitting, respectively. Furthermore, the proposed MLAPSPNet that introduces multilevel pyramid pooling modules (PPMs) and attention gates is constructed. Eventually, the proposed network is compared with some existing networks. Results: The dice similarity coefficient (DSC), sensitivity and Jaccard score of the proposed system can reach 0.885, 0.933 and 0.8, respectively. The introduction of multilevel pyramid pooling modules and attention gates can improve the DSC by 0.029 and 0.022, respectively. Moreover, compared with Res-UNet, Dense-UNet, residual channel attention UNet (RCA-UNet), DeepLab V3+ and UNet++, the DSC is improved by 0.032, 0.026, 0.014, 0.041 and 0.011, respectively. Conclusion: The proposed multilevel attention pyramid scene parsing network can achieve stateof-the-art performance, and the introduction of multilevel pyramid pooling modules and attention gates can improve the performance of glioma segmentation tasks.
引用
收藏
页码:751 / 761
页数:11
相关论文
共 48 条
[1]  
An FP, 2019, BIOMED SIGNAL PROCES, P47
[2]  
[Anonymous], 2012, P 26 ANN C NEUR INF
[3]   Bi-Directional ConvLSTM U-Net with Densley Connected Convolutions [J].
Azad, Reza ;
Asadi-Aghbolaghi, Maryam ;
Fathy, Mahmood ;
Escalera, Sergio .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :406-415
[4]   Association of genomic subtypes of lower-grade gliomas with shape features automatically extracted by a deep learning algorithm [J].
Buda, Mateusz ;
Saha, Ashirbani ;
Mazurowski, Maciej A. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 109 :218-225
[5]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]   Dual-force convolutional neural networks for accurate brain tumor segmentation [J].
Chen, Shengcong ;
Ding, Changxing ;
Liu, Minfeng .
PATTERN RECOGNITION, 2019, 88 :90-100
[8]   The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository [J].
Clark, Kenneth ;
Vendt, Bruce ;
Smith, Kirk ;
Freymann, John ;
Kirby, Justin ;
Koppel, Paul ;
Moore, Stephen ;
Phillips, Stanley ;
Maffitt, David ;
Pringle, Michael ;
Tarbox, Lawrence ;
Prior, Fred .
JOURNAL OF DIGITAL IMAGING, 2013, 26 (06) :1045-1057
[9]   PsLSNet: Automated psoriasis skin lesion segmentation using modified U-Net-based fully convolutional network [J].
Dash, Manoranjan ;
Londhe, Narendra D. ;
Ghosh, Subhojit ;
Semwal, Ashish ;
Sonawane, Rajendra S. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 52 :226-237
[10]   Relaxed triangle inequality ratio of the Sorensen-Dice and Tversky indexes [J].
Gragera, Alonso ;
Suppakitpaisarn, Vorapong .
THEORETICAL COMPUTER SCIENCE, 2018, 718 :37-45