TSRNet: Tongue image segmentation with global and local refinement

被引:13
作者
Cai, Wenjun [1 ]
Zhang, Mengjian [1 ]
Wen, Guihua [1 ]
Yang, Pei [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510000, Peoples R China
基金
美国国家科学基金会;
关键词
Deep learning; Tongue image segmentation; Global refinement; Local refinement; Intelligent traditional Chinese medicine; CHINESE; DIAGNOSIS;
D O I
10.1016/j.displa.2023.102601
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Tongue Image Segmentation is an essential task for intelligent Traditional Chinese Medicine (TCM), as the tongue is sensitive to the physiological conditions and pathological changes of patients and can help physicians determine strategies for the syndrome differentiation. However, it is a big challenge to acquire an accurate tongue segmentation mask, due to the varied shape and texture of the tongue. This paper proposes a novel tongue segmentation network based on an encoder-decoder framework with global and local refinement, named TSRNet. In the global refinement module, we design an effective module for fusing features from an autoencoder, which is pre -trained on tongue images with segmentation labels, so that the network can make use of the prior knowledge. Moreover, in the local refinement module, we perform patch sampling according to the coarse prediction boundary and correct errors through a patch segmentation module. Both two modules are plugged into the decoder to obtain better tongue segmentation results by training end -to -end. Experimental results compared with state-of-the-art models on two real -world tongue datasets demonstrate the effectiveness of the proposed TSRNet.
引用
收藏
页数:9
相关论文
共 55 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   A robust interclass and intraclass loss function for deep learning based tongue segmentation [J].
Cai, Yuanzheng ;
Wang, Tao ;
Liu, Wei ;
Luo, Zhiming .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (22)
[3]  
Cao Hu, 2023, Computer Vision - ECCV 2022 Workshops: Proceedings. Lecture Notes in Computer Science (13803), P205, DOI 10.1007/978-3-031-25066-8_9
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]   ResGANet: Residual group attention network for medical image classification and segmentation [J].
Cheng, Junlong ;
Tian, Shengwei ;
Yu, Long ;
Gao, Chengrui ;
Kang, Xiaojing ;
Ma, Xiang ;
Wu, Weidong ;
Liu, Shijia ;
Lu, Hongchun .
MEDICAL IMAGE ANALYSIS, 2022, 76
[6]  
Cyranoski D, 2018, NATURE, V561, P448, DOI 10.1038/d41586-018-06782-7
[7]   COVID-19 CT image recognition algorithm based on transformer and CNN [J].
Fan, Xiaole ;
Feng, Xiufang ;
Dong, Yunyun ;
Hou, Huichao .
DISPLAYS, 2022, 72
[8]  
Feng Cheng, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12264), P108, DOI 10.1007/978-3-030-59719-1_11
[9]  
Florian L.-C.C. G. P., 2017, RETHINKING ATROUS CO
[10]   Detect, Replace, Refine: Deep Structured Prediction For Pixel Wise Labeling [J].
Gidaris, Spyros ;
Komodakis, Nikos .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7187-7196