HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation

被引:17
作者
Yu, Zhihong [1 ]
Lee, Feifei [1 ,2 ]
Chen, Qiu [3 ]
机构
[1] Univ Shanghai Sci & Technol, Shanghai Engn Res Ctr Assist Devices, Sch Med Instrument & Food Engn, Shanghai 200093, Peoples R China
[2] Univ Shanghai Sci & Technol, Rehabil Engn & Technol Inst, Shanghai 200093, Peoples R China
[3] Kogakuin Univ, Grad Sch Engn, Elect Engn & Elect, Tokyo 1638677, Japan
关键词
Medical image segmentation; Convolutional neural network (CNN); Transformer; Neural architecture search (NAS);
D O I
10.1007/s10489-023-04570-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering that many manually designed convolutional neural networks (CNNs) for different tasks that require considerable time, labor, and domain knowledge have been designed in the medical image segmentation domain and that most CNN networks only consider local feature information while ignoring the global receptive field due to the convolution limitation, there is still much room for performance improvement. Therefore, designing a new method that can fully capture feature information and save considerable time and human energy with less GPU memory consumption and complexity is necessary. In this paper, we propose a novel hybrid CNN-transformer model based on a neural architecture search network (HCT-Net), which designs a hybrid U-shaped CNN with a key-sampling Transformer backbone that considers contextual and long-range pixel information in the search space and uses a single-path neural architecture search that contains a flexible search space and an efficient search strategy to simultaneously find the optimal subnetwork including three types of cells during SuperNet. Compared with various types of medical image segmentation methods, our framework can achieve competitive precision and efficiency on various datasets, and we also validate the generalization on unseen datasets in extended experiments. In this way, we can verify that our method is competitive and robust. The code for the method is available at .
引用
收藏
页码:19990 / 20006
页数:17
相关论文
共 56 条
[1]  
Ali R, 2019, PROC NAECON IEEE NAT, P311, DOI 10.1109/NAECON46414.2019.9058245
[2]   Recurrent residual U-Net for medical image segmentation [J].
Alom, Md Zahangir ;
Yakopcic, Chris ;
Hasan, Mahmudul ;
Taha, Tarek M. ;
Asari, Vijayan K. .
JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
[3]   AdaResU-Net: Multiobjective adaptive convolutional neural network for medical image segmentation [J].
Baldeon-Calisto, Maria ;
Lai-Yuen, Susana K. .
NEUROCOMPUTING, 2020, 392 :325-340
[4]   Exemplar Darknet19 feature generation technique for automated kidney stone detection with coronal CT images [J].
Baygin, Mehmet ;
Yaman, Orhan ;
Barua, Prabal Datta ;
Dogan, Sengul ;
Tuncer, Turker ;
Acharya, U. Rajendra .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 127
[5]   A review of neural architecture search [J].
Baymurzina, Dilyara ;
Golikov, Eugene ;
Burtsev, Mikhail .
NEUROCOMPUTING, 2022, 474 :82-93
[6]   Comparative Validation of Polyp Detection Methods in Video Colonoscopy: Results From the MICCAI 2015 Endoscopic Vision Challenge [J].
Bernal, Jorge ;
Tajkbaksh, Nima ;
Sanchez, Francisco Javier ;
Matuszewski, Bogdan J. ;
Chen, Hao ;
Yu, Lequan ;
Angermann, Quentin ;
Romain, Olivier ;
Rustad, Bjorn ;
Balasingham, Ilangko ;
Pogorelov, Konstantin ;
Choi, Sungbin ;
Debard, Quentin ;
Maier-Hein, Lena ;
Speidel, Stefanie ;
Stoyanov, Danail ;
Brandao, Patrick ;
Cordova, Henry ;
Sanchez-Montes, Cristina ;
Gurudu, Suryakanth R. ;
Fernandez-Esparrach, Gloria ;
Dray, Xavier ;
Liang, Jianming ;
Histace, Aymeric .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (06) :1231-1249
[7]   AdaEn-Net: An ensemble of adaptive 2D-3D Fully Convolutional Networks for medical image segmentation [J].
Calisto, Maria Baldeon ;
Lai-Yuen, Susana K. .
NEURAL NETWORKS, 2020, 126 :76-94
[8]  
Cao H., 2021, arXiv
[9]  
Chen J, 2021, arXiv
[10]   Dynamic Convolution: Attention over Convolution Kernels [J].
Chen, Yinpeng ;
Dai, Xiyang ;
Liu, Mengchen ;
Chen, Dongdong ;
Yuan, Lu ;
Liu, Zicheng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11027-11036