DualBranch-FusionNet: A Hybrid CNN-Transformer Architecture for Cervical Cell Image Classification

被引：0

作者：

Xu, Chuanyun ^{[1
]}

Huang, Shuaiye ^{[1
]}

Zhang, Yang ^{[1
]}

Hu, Die ^{[1
]}

Sun, Yisha ^{[1
]}

Li, Gang ^{[2
]}

机构：

[1] Chongqing Normal Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China

[2] Chongqing Univ Technol, Sch Artifcial Intelligence, Chongqing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY | 2025年 / 35卷 / 03期

关键词：

cervical cancer; convolutional neural network; hybrid architecture; image classification; ATTENTION;

D O I：

10.1002/ima.70101

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Cervical cancer screening relies on accurate cell classification. Approaches based on Convolutional Neural Networks (CNNs) have proven effective in addressing the task. However, these approaches suffer from two main challenges. First, they may introduce bias into models due to variations in cell morphology and color. Second, they may struggle to capture broader contextual information as CNNs primarily focus on local pixel information. To address these issues, we present a novel hybrid model named DualBranch-FusionNet, which combines CNNs for local feature extraction with Transformers for capturing global contextual information to improve cervical cell classification accuracy. The proposed method adopts the three-fold ideas. First, concerning the CNN branch, it introduces Omni-dimensional Dynamic Convolution (ODConv) to adaptively extract detailed features across multiple dimensions and designs an Adaptive Channel Modulation (ACM) mechanism to dynamically emphasize critical feature channels. Second, regarding the Transformer branch, it designs a Dynamic Query-Aware Sparse Attention (DQSA) mechanism to effectively filter out less relevant key-value pairs over a larger receptive field, thereby reducing the interference of irrelevant information. Third, it adopts a fusion strategy, the Simple Fusion Module (SFM), to produce more comprehensive feature representations, leading to improved cervical cell classification accuracy. The proposed model was validated on two datasets: the Mendeley LBC and the Tianchi Cervical Cancer Risk Intelligent Diagnosis Challenge datasets, achieving Accuracies of 99.07% and 99.12%, respectively.

引用

页数：14

共 50 条

[21] Classification of Endoscopy and Video Capsule Images Using CNN-Transformer Model
Subedi, Aliza
Regmi, Smriti
Regmi, Nisha
Bhusal, Bhumi
Bagci, Ulas
Jha, Debesh
CANCER PREVENTION, DETECTION, AND INTERVENTION, CAPTION 2024, 2025, 15199 : 26 - 36
[22] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
Liu, Bin
Fang, Siyan
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30) : 22387 - 22404
[23] Multi-level wavelet network based on CNN-Transformer hybrid attention for single image deraining
Bin Liu
Siyan Fang
Neural Computing and Applications, 2023, 35 : 22387 - 22404
[24] CVM-Cervix: A hybrid cervical Pap-smear image classification framework using CNN, visual transformer and multilayer perceptron
Liu, Wanli
Li, Chen
Xu, Ning
Jiang, Tao
Rahaman, Md Mamunur
Sun, Hongzan
Wu, Xiangchen
Hu, Weiming
Chen, Haoyuan
Sun, Changhao
Yao, Yudong
Grzegorzek, Marcin
PATTERN RECOGNITION, 2022, 130
[25] CVM-Cervix: A hybrid cervical Pap-smear image classification framework using CNN, visual transformer and multilayer perceptron
Liu, Wanli
Li, Chen
Xu, Ning
Jiang, Tao
Rahaman, Md Mamunur
Sun, Hongzan
Wu, Xiangchen
Hu, Weiming
Chen, Haoyuan
Sun, Changhao
Yao, Yudong
Grzegorzek, Marcin
PATTERN RECOGNITION, 2022, 130
[26] DBCT-Net:A dual branch hybrid CNN-transformer network for remote sensing image fusion
Wang, Quanli
Jin, Xin
Jiang, Qian
Wu, Liwen
Zhang, Yunchun
Zhou, Wei
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
[27] MCT-Net: a multi-branch hybrid CNN-transformer model for medical image segmentation
Longfeng Shen
Liangjin Diao
Rui Peng
Jiacong Chen
Zhengtian Lu
Fangzhen Ge
Pattern Analysis and Applications, 2025, 28 (2)
[28] CNN-Transformer with Stepped Distillation for Fine-Grained Visual Classification
Xu, Qin
Liu, Peng
Wang, Jiahui
Huang, Lili
Tang, Jin
PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 364 - 377
[29] Infrared and Visible Image Fusion Based on Autoencoder Composed of CNN-Transformer
Wang, Hongmei
Li, Lin
Li, Chenkai
Lu, Xuanyu
IEEE ACCESS, 2023, 11 : 78956 - 78969
[30] Remote sensing image change detection based on CNN-Transformer structure
Pan, Mengyang
Yang, Hang
Fan, Xianghui
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (10) : 1361 - 1379

← 1 2 3 4 5 →