JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection

被引:0
|
作者
Zhou, Li [1 ]
Liu, Zhenyu [1 ]
Shangguan, Zixuan [1 ]
Yuan, Xiaoyan [1 ]
Li, Yutong [1 ]
Hu, Bin [1 ]
机构
[1] Lanzhou Univ, Lanzhou, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Depression detection; Vlog; Joint Attention Multi-Scale Fusion Network (JAMFN); CLASSIFICATION;
D O I
10.21437/Interspeech.2023-183
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, with the widespread popularity of the Internet, social networks have become an indispensable part of people's lives. As social networks contain information about users' daily moods and states, their development provides a new avenue for detecting depression. Although most current approaches focus on the fusion of multimodal features, the importance of fine-grained behavioral information is ignored. In this paper, we propose the Joint Attention Multi-Scale Fusion Network (JAMFN), a model that reflects the multiscale behavioral information of depression and leverages the proposed Joint Attention Fusion (JAF) module to extract the temporal importance of multiple modalities to guide the fusion of multiscale modal pairs. Our experiment is conducted on D-vlog dataset, and the experimental results demonstrate that the proposed JAMFN model outperforms all the benchmark models, indicating that our proposed JAMFN model can effectively mine the potential depressive behavior.
引用
收藏
页码:3417 / 3421
页数:5
相关论文
共 50 条
  • [1] JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection
    Zhou, Li
    Liu, Zhenyu
    Shangguan, Zixuan
    Yuan, Xiaoyan
    Li, Yutong
    Hu, Bin
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023, 2023-August : 3417 - 3421
  • [2] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [3] Joint information fusion and multi-scale network model for pedestrian detection
    Hexiang Zhang
    Ziyu Hu
    Ruoxin Hao
    The Visual Computer, 2021, 37 : 2433 - 2442
  • [4] Joint information fusion and multi-scale network model for pedestrian detection
    Zhang, Hexiang
    Hu, Ziyu
    Hao, Ruoxin
    VISUAL COMPUTER, 2021, 37 (08): : 2433 - 2442
  • [5] Multi-Scale Feature Fusion Attention Network for Infrared Small Target Detection
    Zhang, Yidan
    Li, Chunlei
    Liu, Yundong
    Liu, Zhoufeng
    Yang, Ruimin
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [6] JAMSNet: A Remote Pulse Extraction Network Based on Joint Attention and Multi-Scale Fusion
    Zhao, Changchen
    Wang, Hongsheng
    Chen, Huiling
    Shi, Weiwei
    Feng, Yuanjing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2783 - 2797
  • [7] Multi-Scale Bilateral Attention Fusion Network For Pansharpening
    Guo Z.
    Li J.
    Lei J.
    Liu J.
    Zhou S.
    Wang B.
    Kasabov N.K.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 15
  • [8] DSFA-SwinNet: A Multi-Scale Attention Fusion Network for Photovoltaic Areas Detection
    Lin, Shaofu
    Yang, Yang
    Liu, Xiliang
    Tian, Li
    REMOTE SENSING, 2025, 17 (02)
  • [9] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    Infrared Physics and Technology, 2022, 125
  • [10] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    INFRARED PHYSICS & TECHNOLOGY, 2022, 125