Multimodal Chinese Agricultural News Classification Method Based on Interactive Attention

被引:1
|
作者
Duan, Xuliang [1 ]
Li, Zhiyao [2 ]
Liu, Lingqi
Liu, Yuhai
机构
[1] Sichuan Agr Univ, Sch Informat Engn, Yaan 625014, Sichuan, Peoples R China
[2] Key Lab Agr Informat Engn Sichuan Prov, Yaan 625014, Sichuan, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Fake news; Data models; Agricultural machinery; Visualization; Training; Attention mechanisms; Semantics; Fisheries; Annotations; Multimedia computing; Multimodal learning; multimodal classification; multimodal Chinese agricultural news dataset; interactive attention mechanism; attention mechanism; feature fusion; Chinese agricultural news classification; Chinese agricultural news;
D O I
10.1109/ACCESS.2024.3482868
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most current research on Chinese agricultural news is limited to text analysis and seldom integrates images, leading to a scarcity of multimodal Chinese agricultural news datasets and an evident gap in multimodal Chinese agricultural news research. To address this, we propose the VECO method, a novel multimodal Chinese agricultural news classification approach that leverages interactive attention mechanisms. This algorithm uses ERNIE for text feature extraction and ViT(Vision Transformer) for image feature extraction, focusing on the interplay of features across modalities to uncover the congruent emotional content present in both the images and text. The integrated features are merged with individual image and text features and subsequently processed through a softmax layer to determine the classification outcomes. Our experiments, conducted on an in-house multimodal Chinese agricultural news dataset, demonstrate that the VECO method outperforms the baseline model, with improvements of 3.27% in precision, 0.59% in recall, and 1.92% in f1-score. The multimodal classification of Chinese agricultural news yields superior performance compared to text-only classification, and the results of the VECO model are notably better than those of other multimodal classification models. Future research can focus on optimizing the multimodal feature fusion algorithm to adapt to more complex agricultural news scenarios.
引用
收藏
页码:161718 / 161731
页数:14
相关论文
共 50 条
  • [21] Attention-Based Multimodal Deep Learning on Vision-Language Data: Models, Datasets, Tasks, Evaluation Metrics and Applications
    Bose, Priyankar
    Rana, Pratip
    Ghosh, Preetam
    IEEE ACCESS, 2023, 11 : 80624 - 80646
  • [22] Multimodal Bilinear Fusion Network With Second-Order Attention-Based Channel Selection for Land Cover Classification
    Li, Xiao
    Lei, Lin
    Sun, Yuli
    Li, Ming
    Kuang, Gangyao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 1011 - 1026
  • [23] Modality Perception Learning-Based Determinative Factor Discovery for Multimodal Fake News Detection
    Wang, Boyue
    Wu, Guangchao
    Li, Xiaoyan
    Gao, Junbin
    Hu, Yongli
    Yin, Baocai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [24] A mutual attention based multimodal fusion for fake news detection on social network
    Guo, Ying
    APPLIED INTELLIGENCE, 2023, 53 (12) : 15311 - 15320
  • [25] Multimodal Semantics-Based Supervised Latent Dirichlet Allocation for Event Classification
    Miao, Naiyang
    Xue, Feng
    Hong, Richang
    IEEE MULTIMEDIA, 2021, 28 (04) : 8 - 17
  • [26] Hierarchical Attention Learning for Multimodal Classification
    Zou, Xin
    Tang, Chang
    Zhang, Wei
    Sun, Kun
    Jiang, Liangxiao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 936 - 941
  • [27] A mutual attention based multimodal fusion for fake news detection on social network
    Ying Guo
    Applied Intelligence, 2023, 53 : 15311 - 15320
  • [28] A Multimodal Sentiment Analysis Approach Based on a Joint Chained Interactive Attention Mechanism
    Qiu, Keyuan
    Zhang, Yingjie
    Zhao, Jiaxu
    Zhang, Shun
    Wang, Qian
    Chen, Feng
    ELECTRONICS, 2024, 13 (10)
  • [29] Multistructure Graph Classification Method With Attention-Based Pooling
    Xu, Yuhua
    Wang, Junli
    Guang, Mingjian
    Yan, Chungang
    Jiang, Changjun
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (02) : 602 - 613
  • [30] LD-MAN: Layout-Driven Multimodal Attention Network for Online News Sentiment Recognition
    Guo, Wenya
    Zhang, Ying
    Cai, Xiangrui
    Meng, Lei
    Yang, Jufeng
    Yuan, Xiaojie
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1785 - 1798