Masked autoencoder of multi-scale convolution strategy combined with knowledge distillation for facial beauty prediction

被引:0
作者
Gan, Junying [1 ]
Xiong, Junling [1 ]
机构
[1] Wuyi Univ, Sch Elect Informat Engn, Jiangmen 529020, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1038/s41598-025-86831-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Facial beauty prediction (FBP) is a leading area of research in artificial intelligence. Currently, there is a small amount of labeled data and a large amount of unlabeled data in the FBP database. The features extracted by the model based on supervised training are limited, resulting in low prediction accuracy. Masked autoencoder (MAE) is a self-supervised learning method that outperforms supervised learning methods without relying on large-scale databases. The MAE can improve the feature extraction ability of the model effectively. The multi-scale convolution strategy can expand the receptive field and combine the attention mechanism of the MAE to capture the dependency between distant pixels and acquire shallow and deep image features. Knowledge distillation can take the abundant knowledge from the teacher net to the student net, reduce the number of parameters, and compress the model. In this paper, the MAE of the multi-scale convolution strategy is combined with knowledge distillation for FBP. First, the MAE model with a multi-scale convolution strategy is constructed and used in the teacher net for pretraining. Second, the MAE model is constructed for the student net. Finally, the teacher net performs knowledge distillation, and the student net receives the loss function transmitted from the teacher net for optimization. The experimental results show that the proposed method outperforms other methods on the FBP task, improves FBP accuracy, and can be widely applied in tasks such as image classification.
引用
收藏
页数:17
相关论文
共 50 条
[11]   Knowledge Distillation Anomaly Detection with Multi-Scale Feature Fusion [J].
Yadang C. ;
Liuren C. ;
Wenbin Y. ;
Jiale Z. .
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (10) :1542-1549
[12]   Multi-scale Feature Extraction and Fusion for Online Knowledge Distillation [J].
Zou, Panpan ;
Teng, Yinglei ;
Niu, Tao .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 :126-138
[13]   A multi-scale and multi-objective optimization strategy for catalytic distillation process [J].
Wang, Qinglian ;
Yang, Zhuo ;
Wang, Jianan ;
Huang, Zhixian ;
Yang, Chen ;
Wang, Hongxing ;
Qiu, Ting .
CHEMICAL ENGINEERING SCIENCE, 2023, 265
[14]   Study of EEG classification of depression by multi-scale convolution combined with the Transformer [J].
Zhai F.-W. ;
Sun F. ;
Jin J. .
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2024, 51 (02) :182-195
[15]   Multi-scale spatiotemporal graph convolution network for air quality prediction [J].
Liang Ge ;
Kunyan Wu ;
Yi Zeng ;
Feng Chang ;
Yaqian Wang ;
Siyu Li .
Applied Intelligence, 2021, 51 :3491-3505
[16]   Multi-scale spatiotemporal graph convolution network for air quality prediction [J].
Ge, Liang ;
Wu, Kunyan ;
Zeng, Yi ;
Chang, Feng ;
Wang, Yaqian ;
Li, Siyu .
APPLIED INTELLIGENCE, 2021, 51 (06) :3491-3505
[17]   HYPERSPECTRAL IMAGE CHANGE DETECTION BASED ON MULTI-SCALE 3D CONVOLUTION AUTOENCODER [J].
Tang, Yingjie ;
Fan, Yuanze ;
Feng, Shou ;
Zhao, Chunhui ;
Luo, Tianfang .
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, :3211-3214
[18]   Joint weighted knowledge distillation and multi-scale feature distillation for long-tailed recognition [J].
Yiru He ;
Shiqian Wang ;
Junyang Yu ;
Chaoyang Liu ;
Xin He ;
Han Li .
International Journal of Machine Learning and Cybernetics, 2024, 15 :1647-1661
[19]   Joint weighted knowledge distillation and multi-scale feature distillation for long-tailed recognition [J].
He, Yiru ;
Wang, Shiqian ;
Yu, Junyang ;
Liu, Chaoyang ;
He, Xin ;
Li, Han .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (04) :1647-1661
[20]   BUS-M2AE: Multi-scale Masked Autoencoder for Breast Ultrasound Image Analysis [J].
Yu, Le ;
Gou, Bo ;
Xia, Xun ;
Yang, Yujia ;
Yi, Zhang ;
Min, Xiangde ;
He, Tao .
Computers in Biology and Medicine, 2025, 191