Doubled coupling for image emotion distribution learning

被引:4
|
作者
Wu, Huiyan [1 ]
Huang, Yonggang [1 ]
Nan, Guoshun [2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Cyber Sci & Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Image emotion distribution; Object coupling; Image coupling; DCGCN; Dynamic iteration; ALGORITHM;
D O I
10.1016/j.knosys.2022.110107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image emotion prediction has a great impact on wide applications, such as social network analysis, advertising, and human-computer interaction. Recently, image emotion distribution learning (IEDL) has attracted increasing attention as it holds the potential to tackle the challenging emotion ambiguity problem for image emotion prediction. Existing efforts focus more on the emotion distribution learning with the assumption of independently identically distribution. However, we observe that the connections between objects in an image (e.g., butterfly and flower) and the connections between different images (e.g., the images taken in the same place), commonly exist in real-world datasets. Coupling information has been proved greatly helpful for many tasks, and also is crucial for image emotion analysis. Such observations motivate us to explore the above two coupling relations for better IEDL. With this in mind, we propose DoubledIEDL, a novel IEDL approach that consists of two sub -modules for object and image coupling learning, respectively. Specifically, our IEDL relies on a unified framework equipped with densely connected graph convolutional networks (DCGCN) for both coupling learning. The learning of our proposed framework has two stages: static stage and dynamic stage. In the first stage, a static graph is constructed to extract the shallow coupling information with DCGCN. Then, in the second stage, the deep coupling information is further mined via DCGCN on dynamically updated graphs in an iterative manner. The sub-modules for object and image coupling learning share this framework, but differ in the static graph constructing strategy. Extensive experiments on the two public benchmarks, FlickrLDL and TwitterLDL, demonstrate the effectiveness of the proposed DoubledIEDL, yielding significant improvement against previous state-of-the-art models. On FlickrLDL, CoupledIEDL achieves 0.8596 in Cosine and 0.4356 in Kullback-Leibler Divergence (K-L). On TwitterLDL, CoupledIEDL achieves 0.8717 in Cosine and 0.4705 in K-L.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Predicting Image Emotion Distribution by Emotional Region
    Fan, Yangyu
    Yang, Hansen
    Li, Zuhe
    Liu, Shu
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [2] Ordinal margin metric learning and its extension for cross-distribution image data
    Tian, Qing
    Chen, Songcan
    Qiao, Lishan
    INFORMATION SCIENCES, 2016, 349 : 50 - 64
  • [3] Hybrid quantum-classical generative adversarial networks for image generation via learning discrete distribution
    Zhou, Nan-Run
    Zhang, Tian-Feng
    Xie, Xin-Wen
    Wu, Jun-Yun
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 110
  • [4] Survey of Deep Representation Learning for Speech Emotion Recognition
    Latif, Siddique
    Rana, Rajib
    Khalifa, Sara
    Jurdak, Raja
    Qadir, Junaid
    Schuller, Bjorn
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1634 - 1654
  • [5] Auto Encoder Feature Learning with Utilization of Local Spatial Information and Data Distribution for Classification of PolSAR Image
    Hou, Biao
    Wang, Jianlong
    Jiao, Licheng
    Wang, Shuang
    REMOTE SENSING, 2019, 11 (11)
  • [6] Dictionary learning for image prediction
    Turkan, Mehmet
    Guillemot, Christine
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (03) : 426 - 437
  • [7] PREDICTION-BASED LEARNING FOR CONTINUOUS EMOTION RECOGNITION IN SPEECH
    Han, Jing
    Zhang, Zixing
    Ringeval, Fabien
    Schuller, Bjorn
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5005 - 5009
  • [8] Multimodal emotion recognition system for e-learning platform
    Vani, R. K. Kapila
    Jayashree, P.
    EDUCATION AND INFORMATION TECHNOLOGIES, 2025,
  • [9] Learning Non-local Image Diffusion for Image Denoising
    Qiao, Peng
    Dou, Yong
    Feng, Wensen
    Li, Rongchun
    Chen, Yunjin
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1847 - 1855
  • [10] Image Transformation Based on Learning Dictionaries across Image Spaces
    Jia, Kui
    Wang, Xiaogang
    Tang, Xiaoou
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (02) : 367 - 380