Bidirectional feature fusion via cross-attention transformer for chrysanthemum classification

被引:0
作者
Chen, Yifan [1 ]
Yang, Xichen [1 ]
Yan, Hui [2 ,3 ]
Liu, Jia [3 ]
Jiang, Jian [1 ]
Mao, Zhongyuan [1 ]
Wang, Tianshu [4 ]
机构
[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing 210046, Jiangsu, Peoples R China
[2] Natl & Local Collaborat Engn Ctr Chinese Med Resou, Nanjing, Peoples R China
[3] Nanjing Univ Chinese Med, Jiangsu Collaborat Innovat Ctr Chinese Med Resourc, Nanjing 210023, Jiangsu, Peoples R China
[4] Nanjing Univ Chinese Med, Coll Artificial Intelligence & Informat Technol, Nanjing 210023, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Chrysanthemums classification; Swin transformer; Two-stream network; Cross attention; Deep learning;
D O I
10.1007/s10044-025-01419-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chrysanthemums hold significant ornamental, economic, and medicinal value, with their quality and economic worth heavily influenced by geographic origin. Accurate classification of chrysanthemums is crucial for ensuring product authenticity, boosting consumer trust, and promoting sustainable industry growth. Traditional classification methods, however, suffer from inefficiency and high costs. To address these challenges, we propose a novel chrysanthemum classification method utilizing a bidirectional feature fusion approach via cross-attention and two-stream network fusion. Our method preprocesses front and back images of chrysanthemums from diverse regions, employing the powerful Swin Transformer as the backbone to extract features. The cross-attention mechanism effectively integrates features from both image sides, and a secondary training strategy further enhances the model's generalization capabilities. Experimental results demonstrate that our method achieves higher accuracy, precision, recall, and F1 score compared to state-of-the-art models, highlighting its potential for accurate chrysanthemum origin tracing. The code and datasets are openly available at https://github.com/dart-into/CCMCAM, ensuring transparency and reproducibility of our findings.
引用
收藏
页数:16
相关论文
共 45 条
[1]   EGDNet: an efficient glomerular detection network for multiple anomalous pathological feature in glomerulonephritis [J].
Ali, Saba Ghazanfar ;
Wang, Xiaoxia ;
Li, Ping ;
Li, Huating ;
Yang, Po ;
Jung, Younhyun ;
Qin, Jing ;
Kim, Jinman ;
Sheng, Bin .
VISUAL COMPUTER, 2025, 41 (04) :2817-2834
[2]   Intelligent detection and waste control of hawthorn fruit based on ripening level using machine vision system and deep learning techniques [J].
Azadnia, Rahim ;
Fouladi, Saman ;
Jahanbakhshi, Ahmad .
RESULTS IN ENGINEERING, 2023, 17
[3]   Chemoinformatics based comprehensive two-dimensional liquid chromatography-quadrupole time-of-flight mass spectrometry approach to chemically distinguish Chrysanthemum species [J].
Chen, Yan ;
Zhen, Xiao-Ting ;
Yu, Ya-Ling ;
Shi, Min-Zhen ;
Cao, Jun ;
Zheng, Hui ;
Ye, Li-Hong .
MICROCHEMICAL JOURNAL, 2021, 168
[4]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[5]   Intelligent detection of citrus fruit pests using machine vision system and convolutional neural network through transfer learning technique [J].
Hadipour-Rokni, Ramazan ;
Asli-Ardeh, Ezzatollah Askari ;
Jahanbakhshi, Ahmad ;
Paeen-Afrakoti, Iman Esmaili ;
Sabzi, Sajad .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
[6]   Nondestructive Determination and Visualization of Quality Attributes in Fresh and Dry Chrysanthemum morifolium Using Near-Infrared Hyperspectral Imaging [J].
He, Juan ;
Zhu, Susu ;
Chu, Bingquan ;
Bai, Xiulin ;
Xiao, Qinlin ;
Zhang, Chu ;
Gong, Jinyan .
APPLIED SCIENCES-BASEL, 2019, 9 (09)
[7]   Masked Autoencoders Are Scalable Vision Learners [J].
He, Kaiming ;
Chen, Xinlei ;
Xie, Saining ;
Li, Yanghao ;
Dollar, Piotr ;
Girshick, Ross .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :15979-15988
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]   Squeeze-and-Excitation Networks [J].
Hu, Jie ;
Shen, Li ;
Sun, Gang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7132-7141
[10]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269