Doubled coupling for image emotion distribution learning

被引：4

作者：

Wu, Huiyan ^{[1
]}

Huang, Yonggang ^{[1
]}

Nan, Guoshun ^{[2
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China

[2] Beijing Univ Posts & Telecommun, Sch Cyber Sci & Engn, Beijing 100876, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 260卷

基金：

中国国家自然科学基金;

关键词：

Image emotion distribution; Object coupling; Image coupling; DCGCN; Dynamic iteration; ALGORITHM;

D O I：

10.1016/j.knosys.2022.110107

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image emotion prediction has a great impact on wide applications, such as social network analysis, advertising, and human-computer interaction. Recently, image emotion distribution learning (IEDL) has attracted increasing attention as it holds the potential to tackle the challenging emotion ambiguity problem for image emotion prediction. Existing efforts focus more on the emotion distribution learning with the assumption of independently identically distribution. However, we observe that the connections between objects in an image (e.g., butterfly and flower) and the connections between different images (e.g., the images taken in the same place), commonly exist in real-world datasets. Coupling information has been proved greatly helpful for many tasks, and also is crucial for image emotion analysis. Such observations motivate us to explore the above two coupling relations for better IEDL. With this in mind, we propose DoubledIEDL, a novel IEDL approach that consists of two sub -modules for object and image coupling learning, respectively. Specifically, our IEDL relies on a unified framework equipped with densely connected graph convolutional networks (DCGCN) for both coupling learning. The learning of our proposed framework has two stages: static stage and dynamic stage. In the first stage, a static graph is constructed to extract the shallow coupling information with DCGCN. Then, in the second stage, the deep coupling information is further mined via DCGCN on dynamically updated graphs in an iterative manner. The sub-modules for object and image coupling learning share this framework, but differ in the static graph constructing strategy. Extensive experiments on the two public benchmarks, FlickrLDL and TwitterLDL, demonstrate the effectiveness of the proposed DoubledIEDL, yielding significant improvement against previous state-of-the-art models. On FlickrLDL, CoupledIEDL achieves 0.8596 in Cosine and 0.4356 in Kullback-Leibler Divergence (K-L). On TwitterLDL, CoupledIEDL achieves 0.8717 in Cosine and 0.4705 in K-L.(c) 2022 Elsevier B.V. All rights reserved.

引用

页数：11

共 50 条

[1] Predicting Image Emotion Distribution by Emotional Region
Fan, Yangyu
Yang, Hansen
Li, Zuhe
Liu, Shu
2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
[2] Ordinal margin metric learning and its extension for cross-distribution image data
Tian, Qing
Chen, Songcan
Qiao, Lishan
INFORMATION SCIENCES, 2016, 349 : 50 - 64
[3] Hybrid quantum-classical generative adversarial networks for image generation via learning discrete distribution
Zhou, Nan-Run
Zhang, Tian-Feng
Xie, Xin-Wen
Wu, Jun-Yun
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 110
[4] Survey of Deep Representation Learning for Speech Emotion Recognition
Latif, Siddique
Rana, Rajib
Khalifa, Sara
Jurdak, Raja
Qadir, Junaid
Schuller, Bjorn
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1634 - 1654
[5] Auto Encoder Feature Learning with Utilization of Local Spatial Information and Data Distribution for Classification of PolSAR Image
Hou, Biao
Wang, Jianlong
Jiao, Licheng
Wang, Shuang
REMOTE SENSING, 2019, 11 (11)
[6] Dictionary learning for image prediction
Turkan, Mehmet
Guillemot, Christine
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (03) : 426 - 437
[7] PREDICTION-BASED LEARNING FOR CONTINUOUS EMOTION RECOGNITION IN SPEECH
Han, Jing
Zhang, Zixing
Ringeval, Fabien
Schuller, Bjorn
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5005 - 5009
[8] Multimodal emotion recognition system for e-learning platform
Vani, R. K. Kapila
Jayashree, P.
EDUCATION AND INFORMATION TECHNOLOGIES, 2025,
[9] Learning Non-local Image Diffusion for Image Denoising
Qiao, Peng
Dou, Yong
Feng, Wensen
Li, Rongchun
Chen, Yunjin
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1847 - 1855
[10] Image Transformation Based on Learning Dictionaries across Image Spaces
Jia, Kui
Wang, Xiaogang
Tang, Xiaoou
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (02) : 367 - 380

← 1 2 3 4 5 →