Masked autoencoder of multi-scale convolution strategy combined with knowledge distillation for facial beauty prediction

被引:0
作者
Gan, Junying [1 ]
Xiong, Junling [1 ]
机构
[1] Wuyi Univ, Sch Elect Informat Engn, Jiangmen 529020, Guangdong, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
中国国家自然科学基金;
关键词
D O I
10.1038/s41598-025-86831-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Facial beauty prediction (FBP) is a leading area of research in artificial intelligence. Currently, there is a small amount of labeled data and a large amount of unlabeled data in the FBP database. The features extracted by the model based on supervised training are limited, resulting in low prediction accuracy. Masked autoencoder (MAE) is a self-supervised learning method that outperforms supervised learning methods without relying on large-scale databases. The MAE can improve the feature extraction ability of the model effectively. The multi-scale convolution strategy can expand the receptive field and combine the attention mechanism of the MAE to capture the dependency between distant pixels and acquire shallow and deep image features. Knowledge distillation can take the abundant knowledge from the teacher net to the student net, reduce the number of parameters, and compress the model. In this paper, the MAE of the multi-scale convolution strategy is combined with knowledge distillation for FBP. First, the MAE model with a multi-scale convolution strategy is constructed and used in the teacher net for pretraining. Second, the MAE model is constructed for the student net. Finally, the teacher net performs knowledge distillation, and the student net receives the loss function transmitted from the teacher net for optimization. The experimental results show that the proposed method outperforms other methods on the FBP task, improves FBP accuracy, and can be widely applied in tasks such as image classification.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Multi-Scale Residual Depthwise Separable Convolution for Metro Passenger Flow Prediction
    Li, Taoying
    Liu, Lu
    Li, Meng
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [22] A Transferable Multi-Scale Convolution Domain Adaptation Network for Bearings RUL Prediction
    Xu, Ziwei
    Zeng, Ying
    Huang, Hong-Zhong
    Huang, Zixing
    Deng, Zhiming
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [23] MFD-KD: Multi-Scale Frequency-Driven Knowledge Distillation
    Dai, Tao
    Huang, Rongliang
    Guo, Hang
    Wang, Jinbao
    Li, Bo
    Zhu, Zexuan
    BIG DATA MINING AND ANALYTICS, 2025, 8 (03): : 563 - 574
  • [24] Shared knowledge distillation for robust multi-scale super-resolution networks
    Na, Youngju
    Kim, Hee Hyeon
    Yoo, Seok Bong
    ELECTRONICS LETTERS, 2022, 58 (13) : 502 - 504
  • [25] A Graph-Based Multi-Scale Approach With Knowledge Distillation for WSI Classification
    Bontempo, Gianpaolo
    Bolelli, Federico
    Porrello, Angelo
    Calderara, Simone
    Ficarra, Elisa
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (04) : 1412 - 1421
  • [26] Multi-Channel Multi-Scale Convolution Attention Variational Autoencoder (MCA-VAE): An Interpretable Anomaly Detection Algorithm Based on Variational Autoencoder
    Liu, Jingwen
    Huang, Yuchen
    Wu, Dizhi
    Yang, Yuchen
    Chen, Yanru
    Chen, Liangyin
    Zhang, Yuanyuan
    SENSORS, 2024, 24 (16)
  • [27] Efficient multi-target vehicle trajectory prediction based on multi-scale graph convolution
    Gu, Xiang
    Wang, Jing
    Cheng, Dengyang
    Li, Chao
    Huang, Qiwei
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [28] Multi-scale Convolution Combined with Adaptive Bi-interval Equalization for Image Enhancement
    Lu H.-X.
    Liu Z.-B.
    Guo P.-Y.
    Pan X.-P.
    Guangzi Xuebao/Acta Photonica Sinica, 2020, 49 (10):
  • [29] Multi-scale Convolution Combined with Adaptive Bi-interval Equalization for Image Enhancement
    Lu Hao-xiang
    Liu Zhen-bing
    Guo Peng-yue
    Pan Xi-peng
    ACTA PHOTONICA SINICA, 2020, 49 (10)
  • [30] Prediction of miRNA-disease association based on heterogeneous hypergraph convolution and heterogeneous graph multi-scale convolution
    Dai, Wei
    Pang, Sifan
    He, Zhichen
    Fu, Xiaodong
    Liu, Li
    Liu, Lijun
    Yu, Ning
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2024, 13 (01):