Multi-scale contextual semantic enhancement network for 3D medical image segmentation

被引:1
|
作者
Xia, Tingjian [1 ]
Huang, Guoheng [1 ]
Pun, Chi-Man [2 ]
Zhang, Weiwen [1 ]
Li, Jiajian [1 ]
Ling, Wing-Kuen [3 ]
Lin, Chao [4 ]
Yang, Qi [4 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Univ Macau, Dept Comp & Informat Sci, Macau 999078, Peoples R China
[3] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Peoples R China
[4] Sun Yat sen Univ, State Key Lab Oncol South China, Collaborat Innovat Ctr Canc Med, Dept Nasopharyngeal,Carcinoma Guangdong Key Lab Na, Guangzhou 510060, Peoples R China
来源
PHYSICS IN MEDICINE AND BIOLOGY | 2022年 / 67卷 / 22期
关键词
3D medical image segmentation; nasopharyngeal carcinoma; liver tumor; multi-scale context; feature enhancement; class imbalance; NEURAL-NETWORKS; NET;
D O I
10.1088/1361-6560/ac9e41
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. Accurate and automatic segmentation of medical images is crucial for improving the efficiency of disease diagnosis and making treatment plans. Although methods based on convolutional neural networks have achieved excellent results in numerous segmentation tasks of medical images, they still suffer from challenges including drastic scale variations of lesions, blurred boundaries of lesions and class imbalance. Our objective is to design a segmentation framework named multi-scale contextual semantic enhancement network (3D MCSE-Net) to address the above problems. Approach. The 3D MCSE-Net mainly consists of a multi-scale context pyramid fusion module (MCPFM), a triple feature adaptive enhancement module (TFAEM), and an asymmetric class correction loss (ACCL) function. Specifically, the MCPFM resolves the problem of unreliable predictions due to variable morphology and drastic scale variations of lesions by capturing the multi-scale global context of feature maps. Subsequently, the TFAEM overcomes the problem of blurred boundaries of lesions caused by the infiltrating growth and complex context of lesions by adaptively recalibrating and enhancing the multi-dimensional feature representation of suspicious regions. Moreover, the ACCL alleviates class imbalances by adjusting asy mmetric correction coefficient and weighting factor. Main results. Our method is evaluated on the nasopharyngeal cancer tumor segmentation (NPCTS) dataset, the public dataset of the MICCAI 2017 liver tumor segmentation (LiTS) challenge and the 3D image reconstruction for comparison of algorithm and DataBase (3Dircadb) dataset to verify its effectiveness and generalizability. The experimental results show the proposed components all have unique strengths and exhibit mutually reinforcing properties. More importantly, the proposed 3D MCSE-Net outperforms previous state-of-the-art methods for tumor segmentation on the NPCTS, LiTS and 3Dircadb dataset. Significance. Our method addresses the effects of drastic scale variations of lesions, blurred boundaries of lesions and class imbalance, and improves tumors segmentation accuracy, which facilitates clinical medical diagnosis and treatment planning.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Semantic Segmentation of Remote Sensing Image Based on Multi-Scale Semantic Encoder-Decoder Network
    Liang Y.
    Yi C.-X.
    Wang G.-Y.
    Hu Y.-H.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3199 - 3214
  • [42] MCPA: multi-scale cross perceptron attention network for 2D medical image segmentation
    Xu, Liang
    Chen, Mingxiao
    Cheng, Yi
    Song, Pengwu
    Shao, Pengfei
    Shen, Shuwei
    Yao, Peng
    Xu, Ronald X.
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [43] An Efficient Multi-Scale Fusion Network for 3D Organs at Risk (OARs) Segmentation
    Srivastava, Abhishek
    Jha, Debesh
    Keles, Elif
    Aydogan, Bulent
    Abazeed, Mohamed
    Bagci, Ulas
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [44] Contextual Multi-scale Region Convolutional 3D Network for Anomalous Activity Detection in Videos
    Santhi, M.
    Sunny, Leya Elizabeth
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 98 - 108
  • [45] Image scene classification based on multi-scale and contextual semantic information
    Zhang, Rui-Jie
    Li, Bi-Cheng
    Wei, Fu-Shan
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2014, 42 (04): : 646 - 652
  • [46] Multi-Class Multi-Scale Series Contextual Model for Image Segmentation
    Seyedhosseini, Mojtaba
    Tasdizen, Tolga
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (11) : 4486 - 4496
  • [47] 3D morphometry of Martian craters from HRSC DEMs using a multi-scale semantic segmentation network and morphological analysis
    Ye, Peiqi
    Huang, Rong
    Xu, Yusheng
    Li, Wendi
    Ye, Zhen
    Tong, Xiaohua
    ICARUS, 2025, 426
  • [48] 3D multi-scale level set segmentation of vertebrae
    Tan, Sovira
    Yao, Jianhua
    Ward, Michael M.
    Yao, Lawrence
    Summers, Ronald M.
    2007 4TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING : MACRO TO NANO, VOLS 1-3, 2007, : 896 - 899
  • [49] SEMI-SUPERVISED MEDICAL IMAGE SEMANTIC SEGMENTATION WITH MULTI-SCALE GRAPH CUT LOSS
    Sun, Junxiao
    Zhang, Yan
    Zhu, Jian
    Wu, Jiasong
    Kong, Youyong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 624 - 628
  • [50] Adaptive multi-scale dual attention network for semantic segmentation
    Wang, Weizhen
    Wang, Suyu
    Li, Yue
    Jin, Yishu
    NEUROCOMPUTING, 2021, 460 : 39 - 49