BSMM-Net: Multi-modal neural network based on bilateral symmetry for nasopharyngeal carcinoma segmentation

被引:0
作者
Zhou, Haoyang [1 ,3 ]
Li, Haojiang [2 ]
Chen, Shuchao [1 ]
Yang, Shixin [1 ]
Ruan, Guangying [2 ]
Liu, Lizhi [2 ]
Chen, Hongbo [1 ]
机构
[1] Guilin Univ Elect Technol, Guangxi Coll & Univ, Sch Life & Environm Sci, Key Lab Biomed Sensors & Intelligent Instruments, Guilin, Guangxi, Peoples R China
[2] Sun Yat sen Univ, Collaborat Innovat Ctr Canc Med, State Key Lab Oncol South China, Canc Ctr SYSUCC, Guanghzou, Guangdong, Peoples R China
[3] Guilin Univ Aerosp Technol, Sch Elect Informat & Automat, Guilin, Guangxi, Peoples R China
来源
FRONTIERS IN HUMAN NEUROSCIENCE | 2023年 / 16卷
基金
中国国家自然科学基金;
关键词
segmentation; MRI; neural network; multi-modal; nasopharyngeal carcinoma; LESION SEGMENTATION; ATTENTION; IMAGES;
D O I
10.3389/fnhum.2022.1068713
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
IntroductionAutomatically and accurately delineating the primary nasopharyngeal carcinoma (NPC) tumors in head magnetic resonance imaging (MRI) images is crucial for patient staging and radiotherapy. Inspired by the bilateral symmetry of head and complementary information of different modalities, a multi-modal neural network named BSMM-Net is proposed for NPC segmentation. MethodsFirst, a bilaterally symmetrical patch block (BSP) is used to crop the image and the bilaterally flipped image into patches. BSP can improve the precision of locating NPC lesions and is a simulation of radiologist locating the tumors with the bilateral difference of head in clinical practice. Second, modality-specific and multi-modal fusion features (MSMFFs) are extracted by the proposed MSMFF encoder to fully utilize the complementary information of T1- and T2-weighted MRI. The MSMFFs are then fed into the base decoder to aggregate representative features and precisely delineate the NPC. MSMFF is the output of MSMFF encoder blocks, which consist of six modality-specific networks and one multi-modal fusion network. Except T1 and T2, the other four modalities are generated from T1 and T2 by the BSP and DT modal generate block. Third, the MSMFF decoder with similar structure to the MSMFF encoder is deployed to supervise the encoder during training and assure the validity of the MSMFF from the encoder. Finally, experiments are conducted on the dataset of 7633 samples collected from 745 patients. Results and discussionThe global DICE, precision, recall and IoU of the testing set are 0.82, 0.82, 0.86, and 0.72, respectively. The results show that the proposed model is better than the other state-of-the-art methods for NPC segmentation. In clinical diagnosis, the BSMM-Net can give precise delineation of NPC, which can be used to schedule the radiotherapy.
引用
收藏
页数:12
相关论文
共 37 条
  • [1] A deep learning approach to segmentation of nasopharyngeal carcinoma using computed tomography
    Bai, Xiaoyu
    Hu, Yan
    Gong, Guanzhong
    Yin, Yong
    Xia, Yong
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 64
  • [2] Optimization with Soft Dice Can Lead to a Volumetric Bias
    Bertels, Jeroen
    Robben, David
    Vandermeulen, Dirk
    Suetens, Paul
    [J]. BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 : 89 - 97
  • [3] Combining Images and T-Staging Information to Improve the Automatic Segmentation of Nasopharyngeal Carcinoma Tumors in MR Images
    Cai, Mingwei
    Wang, Jiazhou
    Yang, Qing
    Guo, Ying
    Zhang, Zhen
    Ying, Hongmei
    Hu, Weigang
    Hu, Chaosu
    [J]. IEEE ACCESS, 2021, 9 : 21323 - 21331
  • [4] Cao H., 2021, arXiv, DOI DOI 10.48550/ARXIV.2105.05537
  • [5] Nasopharyngeal carcinoma segmentation using a region growing technique
    Chanapai, Weerayuth
    Bhongmakapat, Thongchai
    Tuntiyatorn, Lojana
    Ritthipravat, Panrasee
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2012, 7 (03) : 413 - 422
  • [6] MMFNet: A multi-modality MRI fusion network for segmentation of nasopharyngeal carcinoma
    Chen, Huai
    Qi, Yuxiao
    Yin, Yong
    Li, Tengxiang
    Liu, Xiaoqing
    Li, Xiuli
    Gong, Guanzhong
    Wang, Lisheng
    [J]. NEUROCOMPUTING, 2020, 394 : 27 - 40
  • [7] Chen J., 2021, arXiv
  • [8] Semantic segmentation in medical images through transfused convolution and transformer networks
    Dhamija, Tashvik
    Gupta, Anunay
    Gupta, Shreyansh
    Anjum
    Katarya, Rahul
    Singh, Ghanshyam
    [J]. APPLIED INTELLIGENCE, 2023, 53 (01) : 1132 - 1148
  • [9] Image segmentation of nasopharyngeal carcinoma using 3D CNN with long-range skip connection and multi-scale feature pyramid
    Guo, Feng
    Shi, Canghong
    Li, Xiaojie
    Wu, Xi
    Zhou, Jiliu
    Lv, Jiancheng
    [J]. SOFT COMPUTING, 2020, 24 (16) : 12671 - 12680
  • [10] Hatamizadeh A., 2021, arXiv