A self-supervised image aesthetic assessment combining masked image modeling and contrastive learning

被引：0

作者：

Yang, Shuai ^{[1
,2
]}

Wang, Zibei ^{[1
]}

Wang, Guangao ^{[1
]}

Ke, Yongzhen ^{[1
,2
,4
]}

Qin, Fan ^{[3
]}

Guo, Jing ^{[1
,2
]}

Chen, Liming ^{[5
]}

机构：

[1] Tiangong Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China

[2] Tianjin Key Lab Autonomous Intelligence Technol &, Tianjin, Peoples R China

[3] Nankai Univ, Business Sch, Tianjin, Peoples R China

[4] Tiangong Univ, Natl Demonstrat Ctr Expt Engn Training Educ, Tianjin, Peoples R China

[5] Fitow Tianjin Detect Technol Co LTD, Tianjin, Peoples R China

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2024年 / 101卷

关键词：

Image aesthetic assessment; Self-supervised learning; Masked image modeling;

D O I：

10.1016/j.jvcir.2024.104184

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning more abundant image features helps improve the image aesthetic assessment task performance. Masked Image Modeling (MIM) is implemented based on the Vision Transformer (ViT), which learns pixel-level features while reconstructing images. Contrastive learning pulls in the same image features while pushing away different image features in the feature space to learn high-level semantic features. Since contrastive learning and MIM capture different levels of image features, combining these two methods could learn more rich feature representations and thus promote the performance of aesthetic assessment. Therefore, we propose a pretext task combining contrastive learning and MIM with learning richer image features. In this approach, the original image is randomly masked and reconstructed on the online network. The reconstructed and original images composition the positive example to calculate the contrastive loss on the target network. In the experiment on the AVA dataset, our method obtained better performance than the baseline.

引用

页数：7

共 50 条

[1] Pathological Image Contrastive Self-supervised Learning
Qin, Wenkang
Jiang, Shan
Luo, Lin
RESOURCE-EFFICIENT MEDICAL IMAGE ANALYSIS, REMIA 2022, 2022, 13543 : 85 - 94
[2] ATTENTION-GUIDED CONTRASTIVE MASKED IMAGE MODELING FOR TRANSFORMER-BASED SELF-SUPERVISED LEARNING
Zhan, Yucheng
Zhao, Yucheng
Luo, Chong
Zhang, Yueyi
Sun, Xiaoyan
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2490 - 2494
[3] Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning
Sheng, Kekai
Dong, Weiming
Chai, Menglei
Wang, Guohui
Zhou, Peng
Huang, Feiyue
Hu, Bao-Gang
Ji, Rongrong
Ma, Chongyang
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5709 - 5716
[4] Towards Latent Masked Image Modeling for Self-supervised Visual Representation Learning
Wei, Yibing
Gupta, Abhinav
Morgado, Pedro
COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 1 - 17
[5] Masked Image Modeling as a Framework for Self-Supervised Learning Across Eye Movements
Weiler, Robin
Brucklacher, Matthias
Pennartz, Cyriel M. A.
Bohte, Sander M.
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IV, 2024, 15019 : 17 - 31
[6] Self-Supervised Image Aesthetic Assessment Based on Transformer
Jia, Minrui
Wang, Guangao
Wang, Zibei
Yang, Shuai
Ke, Yongzhen
Wang, Kai
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2025, 24 (01)
[7] Image classification framework based on contrastive self-supervised learning
Zhao H.-W.
Zhang J.-R.
Zhu J.-P.
Li H.
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (08): : 1850 - 1856
[8] Similarity contrastive estimation for image and video soft contrastive self-supervised learning
Denize, Julien
Rabarisoa, Jaonary
Orcesi, Astrid
Herault, Romain
MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
[9] Masked Modeling-Based Ultrasound Image Classification via Self-Supervised Learning
Xu, Kele
You, Kang
Zhu, Boqing
Feng, Ming
Feng, Dawei
Yang, Cheng
IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2024, 5 : 226 - 237
[10] Similarity contrastive estimation for image and video soft contrastive self-supervised learning
Julien Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
Machine Vision and Applications, 2023, 34

← 1 2 3 4 5 →