DualBranch-FusionNet: A Hybrid CNN-Transformer Architecture for Cervical Cell Image Classification

被引:0
|
作者
Xu, Chuanyun [1 ]
Huang, Shuaiye [1 ]
Zhang, Yang [1 ]
Hu, Die [1 ]
Sun, Yisha [1 ]
Li, Gang [2 ]
机构
[1] Chongqing Normal Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China
[2] Chongqing Univ Technol, Sch Artifcial Intelligence, Chongqing, Peoples R China
关键词
cervical cancer; convolutional neural network; hybrid architecture; image classification; ATTENTION;
D O I
10.1002/ima.70101
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Cervical cancer screening relies on accurate cell classification. Approaches based on Convolutional Neural Networks (CNNs) have proven effective in addressing the task. However, these approaches suffer from two main challenges. First, they may introduce bias into models due to variations in cell morphology and color. Second, they may struggle to capture broader contextual information as CNNs primarily focus on local pixel information. To address these issues, we present a novel hybrid model named DualBranch-FusionNet, which combines CNNs for local feature extraction with Transformers for capturing global contextual information to improve cervical cell classification accuracy. The proposed method adopts the three-fold ideas. First, concerning the CNN branch, it introduces Omni-dimensional Dynamic Convolution (ODConv) to adaptively extract detailed features across multiple dimensions and designs an Adaptive Channel Modulation (ACM) mechanism to dynamically emphasize critical feature channels. Second, regarding the Transformer branch, it designs a Dynamic Query-Aware Sparse Attention (DQSA) mechanism to effectively filter out less relevant key-value pairs over a larger receptive field, thereby reducing the interference of irrelevant information. Third, it adopts a fusion strategy, the Simple Fusion Module (SFM), to produce more comprehensive feature representations, leading to improved cervical cell classification accuracy. The proposed model was validated on two datasets: the Mendeley LBC and the Tianchi Cervical Cancer Risk Intelligent Diagnosis Challenge datasets, achieving Accuracies of 99.07% and 99.12%, respectively.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Polarformer: Optic Disc and Cup Segmentation Using a Hybrid CNN-Transformer and Polar Transformation
    Feng, Yaowei
    Li, Zhendong
    Yang, Dong
    Hu, Hongkai
    Guo, Hui
    Liu, Hao
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [42] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
    Jin Chuan
    Tong Changqing
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
  • [43] View-independent gait events detection using CNN-transformer hybrid network
    Jamsrandorj, Ankhzaya
    Jung, Dawoon
    Kumar, Konki Sravan
    Arshad, Muhammad Zeeshan
    Lim, Hwasup
    Kim, Jinwook
    Mun, Kyung-Ryoul
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 147
  • [44] RingMo-Lite: A Remote Sensing Lightweight Network With CNN-Transformer Hybrid Framework
    Wang, Yuelei
    Zhang, Ting
    Zhao, Liangjin
    Hu, Lin
    Wang, Zhechao
    Niu, Ziqing
    Cheng, Peirui
    Chen, Kaiqiang
    Zeng, Xuan
    Wang, Zhirui
    Wang, Hongqi
    Sun, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 20
  • [45] TransSea: Hybrid CNN-Transformer With Semantic Awareness for 3-D Brain Tumor Segmentation
    Liu, Yu
    Ma, Yize
    Zhu, Zhiqin
    Cheng, Juan
    Chen, Xun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [46] FBDPN: CNN-Transformer hybrid feature boosting and differential pyramid network for underwater object detection
    Ji, Xun
    Chen, Shijie
    Hao, Li-Ying
    Zhou, Jingchun
    Chen, Long
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 256
  • [47] Ensemble of Hybrid CNN-ELM Model for Image Classification
    Kannojia, Suresh Prasad
    Jaiswal, Gaurav
    2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 538 - 541
  • [48] An Improved Hybrid CNN for Hyperspectral Image Classification
    Li, Yuting
    He, Lin
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [49] An Efficient CNN Architecture for Image Classification on FPGA Accelerator
    Mujawar, Shahmustafa
    Kiran, Divya
    Ramasangu, Hariharan
    2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2018,
  • [50] IC Packaging Material Identification via a Hybrid Deep Learning Framework with CNN-Transformer Bidirectional Interaction
    Zhang, Chengbin
    Zhou, Xuankai
    Cai, Nian
    Zhou, Shuai
    Wang, Han
    MICROMACHINES, 2024, 15 (03)