HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation

被引:13
|
作者
Yu, Zhihong [1 ]
Lee, Feifei [1 ,2 ]
Chen, Qiu [3 ]
机构
[1] Univ Shanghai Sci & Technol, Shanghai Engn Res Ctr Assist Devices, Sch Med Instrument & Food Engn, Shanghai 200093, Peoples R China
[2] Univ Shanghai Sci & Technol, Rehabil Engn & Technol Inst, Shanghai 200093, Peoples R China
[3] Kogakuin Univ, Grad Sch Engn, Elect Engn & Elect, Tokyo 1638677, Japan
关键词
Medical image segmentation; Convolutional neural network (CNN); Transformer; Neural architecture search (NAS);
D O I
10.1007/s10489-023-04570-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering that many manually designed convolutional neural networks (CNNs) for different tasks that require considerable time, labor, and domain knowledge have been designed in the medical image segmentation domain and that most CNN networks only consider local feature information while ignoring the global receptive field due to the convolution limitation, there is still much room for performance improvement. Therefore, designing a new method that can fully capture feature information and save considerable time and human energy with less GPU memory consumption and complexity is necessary. In this paper, we propose a novel hybrid CNN-transformer model based on a neural architecture search network (HCT-Net), which designs a hybrid U-shaped CNN with a key-sampling Transformer backbone that considers contextual and long-range pixel information in the search space and uses a single-path neural architecture search that contains a flexible search space and an efficient search strategy to simultaneously find the optimal subnetwork including three types of cells during SuperNet. Compared with various types of medical image segmentation methods, our framework can achieve competitive precision and efficiency on various datasets, and we also validate the generalization on unseen datasets in extended experiments. In this way, we can verify that our method is competitive and robust. The code for the method is available at .
引用
收藏
页码:19990 / 20006
页数:17
相关论文
共 50 条
  • [31] MedFCT: A Frequency Domain Joint CNN-Transformer Network for Semi-supervised Medical Image Segmentation
    Xie, Shiao
    Huang, Huimin
    Niu, Ziwei
    Lin, Lanfen
    Chen, Yen-Wei
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1913 - 1918
  • [32] UTNETPARA: A HYBRID CNN-TRANSFORMER ARCHITECTURE WITH MULTI-SCALE FUSION FOR WHOLE-SLIDE IMAGE SEGMENTATION
    Huang, Boqiang
    Ying, Jiayu
    Lyu, Ruizhi
    Schaadt, Nadine S.
    Klinkhammer, Barbara M.
    Boor, Peter
    Lotz, Johannes
    Feuerhake, Friedrich
    Merhof, Dorit
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [33] Image Deblurring Based on an Improved CNN-Transformer Combination Network
    Chen, Xiaolin
    Wan, Yuanyuan
    Wang, Donghe
    Wang, Yuqing
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [34] CSU-Net: A CNN-Transformer Parallel Network for Multimodal Brain Tumour Segmentation
    Chen, Yu
    Yin, Ming
    Li, Yu
    Cai, Qian
    ELECTRONICS, 2022, 11 (14)
  • [35] Wild horseshoe crab image denoising based on CNN-transformer architecture
    Han, Lili
    Liu, Xiuping
    Wang, Qingqing
    Xu, Tao
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [36] TACT: Text attention based CNN-Transformer network for polyp segmentation
    Zhao, Yiyang
    Li, Jinjiang
    Hua, Zhen
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (02)
  • [37] HmsU-Net: A hybrid multi-scale U-net based on a CNN and transformer for medical image segmentation
    Fu, Bangkang
    Peng, Yunsong
    He, Junjie
    Tian, Chong
    Sun, Xinhuan
    Wang, Rongpin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 170
  • [38] LATrans-Unet: Improving CNN-Transformer with Location Adaptive for Medical Image Segmentation
    Lin, Qiqin
    Yao, Junfeng
    Hong, Qingqi
    Cao, Xianpeng
    Zhou, Rongzhou
    Xie, Weixing
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 223 - 234
  • [39] SEGTRANSVAE: HYBRID CNN - TRANSFORMER WITH REGULARIZATION FOR MEDICAL IMAGE SEGMENTATION
    Quan-Dung Pham
    Hai Nguyen-Truong
    Nam Nguyen Phuong
    Nguyen, Khoa N. A.
    Nguyen, Chanh D. T.
    Bui, Trung
    Truong, Steven Q. H.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [40] A Computationally Efficient Neural Video Compression Accelerator Based on a Sparse CNN-Transformer Hybrid Network
    Zhang, Siyu
    Mao, Wendong
    Shi, Huihong
    Wang, Zhongfeng
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,