A self-supervised image aesthetic assessment combining masked image modeling and contrastive learning

被引:0
|
作者
Yang, Shuai [1 ,2 ]
Wang, Zibei [1 ]
Wang, Guangao [1 ]
Ke, Yongzhen [1 ,2 ,4 ]
Qin, Fan [3 ]
Guo, Jing [1 ,2 ]
Chen, Liming [5 ]
机构
[1] Tiangong Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Tianjin Key Lab Autonomous Intelligence Technol &, Tianjin, Peoples R China
[3] Nankai Univ, Business Sch, Tianjin, Peoples R China
[4] Tiangong Univ, Natl Demonstrat Ctr Expt Engn Training Educ, Tianjin, Peoples R China
[5] Fitow Tianjin Detect Technol Co LTD, Tianjin, Peoples R China
关键词
Image aesthetic assessment; Self-supervised learning; Masked image modeling;
D O I
10.1016/j.jvcir.2024.104184
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning more abundant image features helps improve the image aesthetic assessment task performance. Masked Image Modeling (MIM) is implemented based on the Vision Transformer (ViT), which learns pixel-level features while reconstructing images. Contrastive learning pulls in the same image features while pushing away different image features in the feature space to learn high-level semantic features. Since contrastive learning and MIM capture different levels of image features, combining these two methods could learn more rich feature representations and thus promote the performance of aesthetic assessment. Therefore, we propose a pretext task combining contrastive learning and MIM with learning richer image features. In this approach, the original image is randomly masked and reconstructed on the online network. The reconstructed and original images composition the positive example to calculate the contrastive loss on the target network. In the experiment on the AVA dataset, our method obtained better performance than the baseline.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Pathological Image Contrastive Self-supervised Learning
    Qin, Wenkang
    Jiang, Shan
    Luo, Lin
    RESOURCE-EFFICIENT MEDICAL IMAGE ANALYSIS, REMIA 2022, 2022, 13543 : 85 - 94
  • [2] ATTENTION-GUIDED CONTRASTIVE MASKED IMAGE MODELING FOR TRANSFORMER-BASED SELF-SUPERVISED LEARNING
    Zhan, Yucheng
    Zhao, Yucheng
    Luo, Chong
    Zhang, Yueyi
    Sun, Xiaoyan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2490 - 2494
  • [3] Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning
    Sheng, Kekai
    Dong, Weiming
    Chai, Menglei
    Wang, Guohui
    Zhou, Peng
    Huang, Feiyue
    Hu, Bao-Gang
    Ji, Rongrong
    Ma, Chongyang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5709 - 5716
  • [4] Towards Latent Masked Image Modeling for Self-supervised Visual Representation Learning
    Wei, Yibing
    Gupta, Abhinav
    Morgado, Pedro
    COMPUTER VISION - ECCV 2024, PT XXXIX, 2025, 15097 : 1 - 17
  • [5] Masked Image Modeling as a Framework for Self-Supervised Learning Across Eye Movements
    Weiler, Robin
    Brucklacher, Matthias
    Pennartz, Cyriel M. A.
    Bohte, Sander M.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IV, 2024, 15019 : 17 - 31
  • [6] Self-Supervised Image Aesthetic Assessment Based on Transformer
    Jia, Minrui
    Wang, Guangao
    Wang, Zibei
    Yang, Shuai
    Ke, Yongzhen
    Wang, Kai
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2025, 24 (01)
  • [7] Image classification framework based on contrastive self-supervised learning
    Zhao H.-W.
    Zhang J.-R.
    Zhu J.-P.
    Li H.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (08): : 1850 - 1856
  • [8] Similarity contrastive estimation for image and video soft contrastive self-supervised learning
    Denize, Julien
    Rabarisoa, Jaonary
    Orcesi, Astrid
    Herault, Romain
    MACHINE VISION AND APPLICATIONS, 2023, 34 (06)
  • [9] Masked Modeling-Based Ultrasound Image Classification via Self-Supervised Learning
    Xu, Kele
    You, Kang
    Zhu, Boqing
    Feng, Ming
    Feng, Dawei
    Yang, Cheng
    IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2024, 5 : 226 - 237
  • [10] Similarity contrastive estimation for image and video soft contrastive self-supervised learning
    Julien Denize
    Jaonary Rabarisoa
    Astrid Orcesi
    Romain Hérault
    Machine Vision and Applications, 2023, 34