FairViT: Fair Vision Transformer via Adaptive Masking

被引:0
作者
Tian, Bowei [1 ]
Du, Ruijie [2 ]
Shen, Yanning [2 ]
机构
[1] Wuhan Univ, Wuhan 430072, Hubei, Peoples R China
[2] Univ Calif Irvine, Irvine, CA 92697 USA
来源
COMPUTER VISION - ECCV 2024, PT LXV | 2025年 / 15123卷
关键词
Vision Transformer; Accuracy; Fairness; Adaptive Masking;
D O I
10.1007/978-3-031-73650-6_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision Transformer (ViT) has achieved excellent performance and demonstrated its promising potential in various computer vision tasks. The wide deployment of ViT in real-world tasks requires a thorough understanding of the societal impact of the model. However, most ViT-based works do not take fairness into account and it is unclear whether directly applying CNN-oriented debiased algorithm to ViT is feasible. Moreover, previous works typically sacrifice accuracy for fairness. Therefore, we aim to develop an algorithm that improves accuracy without sacrificing fairness. In this paper, we propose FairViT, a novel accurate and fair ViT framework. To this end, we introduce a novel distance loss and deploy adaptive fairness-aware masks on attention layers updating with model parameters. Experimental results show FairViT can achieve accuracy better than other alternatives, even with competitive computational efficiency. Furthermore, FairViT achieves appreciable fairness results.
引用
收藏
页码:451 / 466
页数:16
相关论文
共 50 条
[41]   Vision Conformer: Incorporating Convolutions into Vision Transformer Layers [J].
Iwana, Brian Kenji ;
Kusuda, Akihiro .
DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2023, PT IV, 2023, 14190 :54-69
[42]   Customized Transformer Adapter With Frequency Masking for Deepfake Detection [J].
Shi, Zenan ;
Chen, Haipeng ;
Jia, Yixin ;
Zhang, Dong ;
Lu, Wei ;
Yang, Xun .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 :5904-5918
[43]   Fast and robust face recognition via coding residual map learning based adaptive masking [J].
Yang, Meng ;
Feng, Zhizhao ;
Shiu, Simon C. K. ;
Zhang, Lei .
PATTERN RECOGNITION, 2014, 47 (02) :535-543
[44]   AITFuse: Infrared and visible image fusion via adaptive interactive transformer learning [J].
Wang, Zhishe ;
Yang, Fan ;
Sun, Jing ;
Xu, Jiawei ;
Yang, Fengbao ;
Yan, Xiaomei .
KNOWLEDGE-BASED SYSTEMS, 2024, 299
[45]   Trustworthy and Fair Federated Learning via Reputation-Based Consensus and Adaptive Incentives [J].
Rashid, Md Mamunur ;
Xiang, Yong ;
Uddin, Md Palash ;
Tang, Jine ;
Sood, Keshav ;
Gao, Longxiang .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 :2868-2882
[46]   Taylor-Series-Expansion-Based Vision Transformer Models [J].
Yu, Chong ;
Chen, Tao ;
Gan, Zhongxue .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (09) :8213-8230
[47]   Vision Transformer and Brain Connectivity Patterns for Estimating Cognitive States [J].
Das Chakladar, Debashis .
IEEE ACCESS, 2025, 13 :74602-74612
[48]   ROPGCViT: A Novel Explainable Vision Transformer for Retinopathy of Prematurity Diagnosis [J].
Yurdakul, Mustafa ;
Uyar, Kubra ;
Tasdemir, Sakir ;
Atabas, Irfan .
IEEE ACCESS, 2025, 13 :77064-77079
[49]   CardSegNet: An adaptive hybrid CNN-vision transformer model for heart region segmentation in cardiac MRI [J].
Aghapanah, Hamed ;
Rasti, Reza ;
Kermani, Saeed ;
Tabesh, Faezeh ;
Banaem, Hossein Yousefi ;
Aliakbar, Hamidreza Pour ;
Sanei, Hamid ;
Segars, William Paul .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 115
[50]   A Hyperspectral Image Classification Method Based on Adaptive Spectral Spatial Kernel Combined with Improved Vision Transformer [J].
Wang, Aili ;
Xing, Shuang ;
Zhao, Yan ;
Wu, Haibin ;
Iwahori, Yuji .
REMOTE SENSING, 2022, 14 (15)