A Thangka cultural element classification model based on self-supervised contrastive learning and MS Triplet Attention

被引：1

作者：

Tang, Wenjing ^{[1
]}

Xie, Qing ^{[1
,2
]}

机构：

[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China

[2] Minist Educ, Engn Res Ctr Intelligent Serv Technol Digital Publ, Wuhan, Peoples R China

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Tibetan Thangka classification; Sample imbalance problem; Self-supervised contrastive learning; Gradient Harmonizing Mechanism Loss; Attention mechanism;

D O I：

10.1007/s00371-024-03397-0

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Being a significant repository of Buddhist imagery, Thangka images are valuable historical materials of Tibetan studies, which covers many domains such as Tibetan history, politics, culture, social life and even traditional medicine and astronomy. Thangka cultural element images are the essence of Thangka images. Hence, Thangka cultural element images classification is one of the most important works of knowledge representation and mining in the field of Thangka and is the foundation of digital protection of Thangka images. However, due to the limited quantity, high complexity and the intricate textures of Thangka images, the classification of Thangka images is limited to a small number of categories and coarse granularity. Thus, a novel fusion texture feature dual-branch Thangka cultural elements classification model based on the attention mechanism and self-supervised contrastive learning has been proposed in this paper. Specifically, to address the issue of insufficient labeled samples and improve the classification performance, this method utilizes a large amount of unlabeled irrelevant data to pre-train the feature extractor through self-supervised learning. During the fine-tuning stage of the downstream task, a dual-branch feature extraction structure incorporating texture features has been designed, and MS Triplet Attention proposed by us is used for the integration of important features. Additionally, to address the problem of sample imbalance and the existence of a large number of difficult samples in the Thangka cultural element dataset, the Gradient Harmonizing Mechanism Loss has been adopted, and it has been improved by introducing a self-designed adaptive mechanism. The experimental results on Thangka cultural elements dataset prove the superiority of the proposed method over the state-of-the-art methods. The source code of our proposed algorithm and the related datasets is available at https://github.com/WiniTang/MS-BiCLR.

引用

页码：3919 / 3935

页数：17

共 50 条

[1] Self-Supervised Contrastive Learning for Singing Voices
Yakura, Hiromu
Watanabe, Kento
Goto, Masataka
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1614 - 1623
[2] SWaCo: Safe Wafer Bin Map Classification With Self-Supervised Contrastive Learning
Kwak, Min Gu
Lee, Young Jae
Kim, Seoung Bum
IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2023, 36 (03) : 416 - 424
[3] Parkinson's Disease Classification with Self-supervised Learning and Attention Mechanism
Zhang, Yuchen
Lei, Haijun
Huang, Zhongwei
Zhao, Menglu
Li, Zhen
Liu, Chuan-Ming
Lei, Baiying
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4601 - 4607
[4] Modulation Recognition of Digital Signals Based on Contrastive Self-Supervised Learning
Liao, Yanping
Gao, Yang
Guo, Qiang
2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 432 - 436
[5] An Attention Self-supervised Contrastive Learning based Three-stage Model for Hand Shape Feature Representation in Cued Speech
Wang, Jianrong
Gu, Nan
Yu, Mei
Li, Xuewei
Fang, Qiang
Liu, Li
INTERSPEECH 2021, 2021, : 626 - 630
[6] Self-supervised Contrastive Learning for Predicting Game Strategies
Lee, Young Jae
Baek, Insung
Jo, Uk
Kim, Jaehoon
Bae, Jinsoo
Jeong, Keewon
Kim, Seoung Bum
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2023, 542 : 136 - 147
[7] ASDC-FER: attention-guided self-supervised distilled contrastive learning for facial expression recognition
Yan, Lingyu
Yang, Jinquan
Wang, Chunzhi
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
[8] Missing nodes detection on graphs with self-supervised contrastive learning
Liu, Chen
Cao, Tingting
Zhou, Lixin
Shao, Ying
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
[9] Self-supervised learning representation for abnormal acoustic event detection based on attentional contrastive learning
Wei, Juan
Zhang, Qian
Ning, Weichen
DIGITAL SIGNAL PROCESSING, 2023, 142
[10] Self-supervised contrastive learning for heterogeneous graph based on multi-pretext tasks
Shuai Ma
Jian-wei Liu
Neural Computing and Applications, 2023, 35 : 10275 - 10296

← 1 2 3 4 5 →