A Thangka cultural element classification model based on self-supervised contrastive learning and MS Triplet Attention

被引：1

作者：

Tang, Wenjing ^{[1
]}

Xie, Qing ^{[1
,2
]}

机构：

[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China

[2] Minist Educ, Engn Res Ctr Intelligent Serv Technol Digital Publ, Wuhan, Peoples R China

来源：

VISUAL COMPUTER | 2024年 / 40卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Tibetan Thangka classification; Sample imbalance problem; Self-supervised contrastive learning; Gradient Harmonizing Mechanism Loss; Attention mechanism;

D O I：

10.1007/s00371-024-03397-0

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Being a significant repository of Buddhist imagery, Thangka images are valuable historical materials of Tibetan studies, which covers many domains such as Tibetan history, politics, culture, social life and even traditional medicine and astronomy. Thangka cultural element images are the essence of Thangka images. Hence, Thangka cultural element images classification is one of the most important works of knowledge representation and mining in the field of Thangka and is the foundation of digital protection of Thangka images. However, due to the limited quantity, high complexity and the intricate textures of Thangka images, the classification of Thangka images is limited to a small number of categories and coarse granularity. Thus, a novel fusion texture feature dual-branch Thangka cultural elements classification model based on the attention mechanism and self-supervised contrastive learning has been proposed in this paper. Specifically, to address the issue of insufficient labeled samples and improve the classification performance, this method utilizes a large amount of unlabeled irrelevant data to pre-train the feature extractor through self-supervised learning. During the fine-tuning stage of the downstream task, a dual-branch feature extraction structure incorporating texture features has been designed, and MS Triplet Attention proposed by us is used for the integration of important features. Additionally, to address the problem of sample imbalance and the existence of a large number of difficult samples in the Thangka cultural element dataset, the Gradient Harmonizing Mechanism Loss has been adopted, and it has been improved by introducing a self-designed adaptive mechanism. The experimental results on Thangka cultural elements dataset prove the superiority of the proposed method over the state-of-the-art methods. The source code of our proposed algorithm and the related datasets is available at https://github.com/WiniTang/MS-BiCLR.

引用

页码：3919 / 3935

页数：17

共 50 条

[41] SSA-GAT: Graph-Based Self-supervised Learning for Network Intrusion Detection
Liu, Qian
Zhang, Hui
Zhang, Youpeng
Fan, Lin
Jin, Xue
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IX, 2024, 15024 : 476 - 491
[42] Multi-label modality enhanced attention based self-supervised deep cross-modal hashing
Zou, Xitao
Wu, Song
Zhang, Nian
Bakker, Erwin M.
KNOWLEDGE-BASED SYSTEMS, 2022, 239
[43] Self-Supervised Monocular Depth Estimation for Traffic Scenes Based on Dual Attention Mechanism and Adaptive Cost Volume
Wu G.
Liu W.
Hu J.
Cheng S.
Yang W.-X.
Sun L.-K.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (05): : 1670 - 1678
[44] Attention-based label consistency for semi-supervised deep learning based image classification
Chen, Jiaming
Yang, Meng
Ling, Jie
NEUROCOMPUTING, 2021, 453 : 731 - 741
[45] Classification Model of Clock Drawing Test Based on Contrastive Learning Using Multi-Channel Features With Channel-Spatial Attention
Kang, Changsu
Wang, Bohyun
Lim, J. S.
IEEE ACCESS, 2024, 12 : 186466 - 186475
[46] Self-Supervised Real-World Image Denoising Based on Multi-Scale Feature Enhancement and Attention Fusion
Tang, Hailiang
Zhang, Wenxiao
Zhu, Hailin
Zhao, Ke
IEEE ACCESS, 2024, 12 : 49720 - 49734
[47] Brain Tumor Classification Based on Attention Guided Deep Learning Model
Wen Jun
Zheng Liyuan
International Journal of Computational Intelligence Systems, 15
[48] Brain Tumor Classification Based on Attention Guided Deep Learning Model
Wen Jun
Zheng Liyuan
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 15 (01)
[49] GAF-MAE: A Self-Supervised Automatic Modulation Classification Method Based on Gramian Angular Field and Masked Autoencoder
Shi, Yunhao
Xu, Hua
Zhang, Yue
Qi, Zisen
Wang, Dan
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (01) : 94 - 106
[50] A simple self-supervised learning framework with patch-based data augmentation in diagnosis of Alzheimer's disease
Gong, Haoqiang
Wang, Zhiwen
Huang, Shuaihui
Wang, Jinfeng
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96

← 1 2 3 4 5 →