DualBranch-FusionNet: A Hybrid CNN-Transformer Architecture for Cervical Cell Image Classification

被引:0
|
作者
Xu, Chuanyun [1 ]
Huang, Shuaiye [1 ]
Zhang, Yang [1 ]
Hu, Die [1 ]
Sun, Yisha [1 ]
Li, Gang [2 ]
机构
[1] Chongqing Normal Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China
[2] Chongqing Univ Technol, Sch Artifcial Intelligence, Chongqing, Peoples R China
关键词
cervical cancer; convolutional neural network; hybrid architecture; image classification; ATTENTION;
D O I
10.1002/ima.70101
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cervical cancer screening relies on accurate cell classification. Approaches based on Convolutional Neural Networks (CNNs) have proven effective in addressing the task. However, these approaches suffer from two main challenges. First, they may introduce bias into models due to variations in cell morphology and color. Second, they may struggle to capture broader contextual information as CNNs primarily focus on local pixel information. To address these issues, we present a novel hybrid model named DualBranch-FusionNet, which combines CNNs for local feature extraction with Transformers for capturing global contextual information to improve cervical cell classification accuracy. The proposed method adopts the three-fold ideas. First, concerning the CNN branch, it introduces Omni-dimensional Dynamic Convolution (ODConv) to adaptively extract detailed features across multiple dimensions and designs an Adaptive Channel Modulation (ACM) mechanism to dynamically emphasize critical feature channels. Second, regarding the Transformer branch, it designs a Dynamic Query-Aware Sparse Attention (DQSA) mechanism to effectively filter out less relevant key-value pairs over a larger receptive field, thereby reducing the interference of irrelevant information. Third, it adopts a fusion strategy, the Simple Fusion Module (SFM), to produce more comprehensive feature representations, leading to improved cervical cell classification accuracy. The proposed model was validated on two datasets: the Mendeley LBC and the Tianchi Cervical Cancer Risk Intelligent Diagnosis Challenge datasets, achieving Accuracies of 99.07% and 99.12%, respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Classification of Endoscopy and Video Capsule Images Using CNN-Transformer Model
    Subedi, Aliza
    Regmi, Smriti
    Regmi, Nisha
    Bhusal, Bhumi
    Bagci, Ulas
    Jha, Debesh
    CANCER PREVENTION, DETECTION, AND INTERVENTION, CAPTION 2024, 2025, 15199 : 26 - 36
  • [22] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
    Liu, Bin
    Fang, Siyan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30) : 22387 - 22404
  • [23] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
    Bin Liu
    Siyan Fang
    Neural Computing and Applications, 2023, 35 : 22387 - 22404
  • [24] CVM-Cervix: A hybrid cervical Pap-smear image classification framework using CNN, visual transformer and multilayer perceptron
    Liu, Wanli
    Li, Chen
    Xu, Ning
    Jiang, Tao
    Rahaman, Md Mamunur
    Sun, Hongzan
    Wu, Xiangchen
    Hu, Weiming
    Chen, Haoyuan
    Sun, Changhao
    Yao, Yudong
    Grzegorzek, Marcin
    PATTERN RECOGNITION, 2022, 130
  • [25] CVM-Cervix: A hybrid cervical Pap-smear image classification framework using CNN, visual transformer and multilayer perceptron
    Liu, Wanli
    Li, Chen
    Xu, Ning
    Jiang, Tao
    Rahaman, Md Mamunur
    Sun, Hongzan
    Wu, Xiangchen
    Hu, Weiming
    Chen, Haoyuan
    Sun, Changhao
    Yao, Yudong
    Grzegorzek, Marcin
    PATTERN RECOGNITION, 2022, 130
  • [26] DBCT-Net:A dual branch hybrid CNN-transformer network for remote sensing image fusion
    Wang, Quanli
    Jin, Xin
    Jiang, Qian
    Wu, Liwen
    Zhang, Yunchun
    Zhou, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [27] MCT-Net: a multi-branch hybrid CNN-transformer model for medical image segmentation
    Longfeng Shen
    Liangjin Diao
    Rui Peng
    Jiacong Chen
    Zhengtian Lu
    Fangzhen Ge
    Pattern Analysis and Applications, 2025, 28 (2)
  • [28] CNN-Transformer with Stepped Distillation for Fine-Grained Visual Classification
    Xu, Qin
    Liu, Peng
    Wang, Jiahui
    Huang, Lili
    Tang, Jin
    PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 364 - 377
  • [29] Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer
    Wang, Hongmei
    Li, Lin
    Li, Chenkai
    Lu, Xuanyu
    IEEE ACCESS, 2023, 11 : 78956 - 78969
  • [30] Remote sensing image change detection based on CNN-Transformer structure
    Pan, Mengyang
    Yang, Hang
    Fan, Xianghui
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (10) : 1361 - 1379