Multimodal Chinese Agricultural News Classification Method Based on Interactive Attention

被引:1
|
作者
Duan, Xuliang [1 ]
Li, Zhiyao [2 ]
Liu, Lingqi
Liu, Yuhai
机构
[1] Sichuan Agr Univ, Sch Informat Engn, Yaan 625014, Sichuan, Peoples R China
[2] Key Lab Agr Informat Engn Sichuan Prov, Yaan 625014, Sichuan, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Fake news; Data models; Agricultural machinery; Visualization; Training; Attention mechanisms; Semantics; Fisheries; Annotations; Multimedia computing; Multimodal learning; multimodal classification; multimodal Chinese agricultural news dataset; interactive attention mechanism; attention mechanism; feature fusion; Chinese agricultural news classification; Chinese agricultural news;
D O I
10.1109/ACCESS.2024.3482868
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most current research on Chinese agricultural news is limited to text analysis and seldom integrates images, leading to a scarcity of multimodal Chinese agricultural news datasets and an evident gap in multimodal Chinese agricultural news research. To address this, we propose the VECO method, a novel multimodal Chinese agricultural news classification approach that leverages interactive attention mechanisms. This algorithm uses ERNIE for text feature extraction and ViT(Vision Transformer) for image feature extraction, focusing on the interplay of features across modalities to uncover the congruent emotional content present in both the images and text. The integrated features are merged with individual image and text features and subsequently processed through a softmax layer to determine the classification outcomes. Our experiments, conducted on an in-house multimodal Chinese agricultural news dataset, demonstrate that the VECO method outperforms the baseline model, with improvements of 3.27% in precision, 0.59% in recall, and 1.92% in f1-score. The multimodal classification of Chinese agricultural news yields superior performance compared to text-only classification, and the results of the VECO model are notably better than those of other multimodal classification models. Future research can focus on optimizing the multimodal feature fusion algorithm to adapt to more complex agricultural news scenarios.
引用
收藏
页码:161718 / 161731
页数:14
相关论文
共 50 条
  • [41] HIAN: A hybrid interactive attention network for multimodal sarcasm detection
    Bao, Yongtang
    Zhao, Xin
    Zhang, Peng
    Qi, Yue
    Li, Haojie
    PATTERN RECOGNITION, 2025, 164
  • [42] Multi-hop interactive attention based classification network for expert recommendation
    Qian, Lingfei
    Wang, Jian
    Lin, Hongfei
    Yang, Liang
    Zhang, Yu
    NEUROCOMPUTING, 2022, 488 : 436 - 443
  • [43] A Temporal-and-Spatial Flow Based Multimodal Fake News Detection by Pooling and Attention Blocks
    Guo, Ying
    Song, Wei
    IEEE ACCESS, 2022, 10 : 131498 - 131508
  • [44] Multimodal Pre-Training Based on Graph Attention Network for Document Understanding
    Zhang, Zhenrong
    Ma, Jiefeng
    Du, Jun
    Wang, Licheng
    Zhang, Jianshu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6743 - 6755
  • [45] AMNN: Attention-Based Multimodal Neural Network Model for Hashtag Recommendation
    Yang, Qi
    Wu, Gaosheng
    Li, Yuhua
    Li, Ruixuan
    Gu, Xiwu
    Deng, Huicai
    Wu, Junzhuang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (03) : 768 - 779
  • [46] SANet: A Self-Attention Network for Agricultural Hyperspectral Image Classification
    Zhang, Bo
    Chen, Yaxiong
    Li, Zhiheng
    Xiong, Shengwu
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [47] Gated attention fusion network for multimodal sentiment classification
    Du, Yongping
    Liu, Yang
    Peng, Zhi
    Jin, Xingnan
    KNOWLEDGE-BASED SYSTEMS, 2022, 240
  • [48] Network Intrusion Detection Method Based on CNN-BiLSTM-Attention Model
    Dai, Wei
    Li, Xinhui
    Ji, Wenxin
    He, Sicheng
    IEEE ACCESS, 2024, 12 : 53099 - 53111
  • [49] An Aeromagnetic Compensation Method Based on Attention Mechanism
    Ma, Xiaoyu
    Zhang, Jinsheng
    Liao, Shouyi
    Li, Ting
    Li, Zehao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [50] Multimodal Co-training for Fake News Identification Using Attention-aware Fusion
    Das Bhattacharjee, Sreyasee
    Yuan, Junsong
    PATTERN RECOGNITION, ACPR 2021, PT II, 2022, 13189 : 282 - 296