SpineParseNet: Spine Parsing for Volumetric MR Image by a Two-Stage Segmentation Framework With Semantic Image Representation

被引:111
作者
Pang, Shumao [1 ]
Pang, Chunlan [2 ]
Zhao, Lei [1 ]
Chen, Yangfan [1 ]
Su, Zhihai [3 ]
Zhou, Yujia [1 ]
Huang, Meiyan [1 ]
Yang, Wei [1 ]
Lu, Hai [3 ]
Feng, Qianjin [1 ]
机构
[1] Southern Med Univ, Sch Biomed Engn, Guangdong Prov Key Lab Med Image Proc, Guangzhou 510515, Peoples R China
[2] Sun Yat Sen Univ, Dept Nucl Med, Canc Ctr, Guangzhou 510060, Peoples R China
[3] Sun Yat Sen Univ, Affiliated Hosp 5, Dept Spinal Surg, Zhuhai 519000, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Image segmentation; Image representation; Semantics; Computed tomography; Three-dimensional displays; Two dimensional displays; Learning systems; Deep learning; spine; 3D segmentation; graph convolution; 3D; LOCALIZATION;
D O I
10.1109/TMI.2020.3025087
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Spine parsing (i.e., multi-class segmentation of vertebrae and intervertebral discs (IVDs)) for volumetric magnetic resonance (MR) image plays a significant role in various spinal disease diagnoses and treatments of spine disorders, yet is still a challenge due to the inter-class similarity and intra-class variation of spine images. Existing fully convolutional network based methods failed to explicitly exploit the dependencies between different spinal structures. In this article, we propose a novel two-stage framework named SpineParseNet to achieve automated spine parsing for volumetric MR images. The SpineParseNet consists of a 3D graph convolutional segmentation network (GCSN) for 3D coarse segmentation and a 2D residual U-Net (ResUNet) for 2D segmentation refinement. In 3D GCSN, region pooling is employed to project the image representation to graph representation, in which each node representation denotes a specific spinal structure. The adjacency matrix of the graph is designed according to the connection of spinal structures. The graph representation is evolved by graph convolutions. Subsequently, the proposed region unpooling module re-projects the evolved graph representation to a semantic image representation, which facilitates the 3D GCSN to generate reliable coarse segmentation. Finally, the 2D ResUNet refines the segmentation. Experiments on T2-weighted volumetric MR images of 215 subjects show that SpineParseNet achieves impressive performance with mean Dice similarity coefficients of 87.32 +/- 4.75%, 87.78 +/- 4.64%, and 87.49 +/- 3.81% for the segmentations of 10 vertebrae, 9 IVDs, and all 19 spinal structures respectively. The proposed method has great potential in clinical spinal disease diagnoses and treatments.
引用
收藏
页码:262 / 273
页数:12
相关论文
共 35 条
[1]  
[Anonymous], 2017, Superhuman Accuracy on the SNEMI3D Connectomics Challenge
[2]   Localization and Segmentation of 3D Intervertebral Discs in MR Images by Data Driven Estimation [J].
Chen, Cheng ;
Belavy, Daniel ;
Yu, Weimin ;
Chu, Chengwen ;
Armbrecht, Gabriele ;
Bansmann, Martin ;
Felsenberg, Dieter ;
Zheng, Guoyan .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2015, 34 (08) :1719-1729
[3]   3D registration based perception in augmented reality environment [J].
Chen, Heen ;
He, Hanwu ;
Mo, Jianqing ;
Li, Jinfang ;
Yang, Xian .
COGENT ENGINEERING, 2016, 3 (01)
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[6]  
Cicek O., 2016, INT C MEDICAL IMAGE, DOI [10.1007/978-3-319-46723-8_49, DOI 10.1007/978-3-319-46723-8_49]
[7]   Simultaneous Volumetric Segmentation of Vertebral Bodies and Intervertebral Discs on Fat-Water MR Images [J].
Fallah, Faezeh ;
Walter, Sven Stephan ;
Bamberg, Fabian ;
Yang, Bin .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (04) :1692-1701
[8]  
Forsberg D., 2015, LECT NOTES COMPUTATI, P215
[9]  
Gong K, 2019, PROC CVPR IEEE, P7442, DOI [10.1109/cvpr.2019.00763, 10.1109/CVPR.2019.00763]
[10]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]