S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing With Statistical Tokens

被引：2

作者：

Cai, Rizhao ^{[1
]}

Yu, Zitong ^{[2
]}

Kong, Chenqi ^{[1
]}

Li, Haoliang ^{[3
]}

Chen, Changsheng ^{[4
,5
]}

Hu, Yongjian ^{[6
,7
]}

Kot, Alex C. ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch EEE, ROSE Lab, Singapore 639798, Singapore

[2] Great Bay Univ, Sch Comp & Informat Technol, Shantou 523000, Peoples R China

[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

[4] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen Key Lab Media Secur, State Key Lab Radiofrequency Heterogeneous Integra, Shenzhen 518060, Peoples R China

[5] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China

[6] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 511442, Peoples R China

[7] China Singapore Int Joint Res Inst, Guangzhou 510555, Peoples R China

来源：

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY | 2024年 / 19卷

基金：

中国国家自然科学基金;

关键词：

Adaptation models; Face recognition; Training; Histograms; Data models; Feature extraction; Faces; Vision transformer (ViT); adapter; histogram; face anti-spoofing; face presentation attack detection; domain generalization; PRESENTATION ATTACK DETECTION; ADAPTATION;

D O I：

10.1109/TIFS.2024.3420699

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Face Anti-Spoofing (FAS) aims to detect malicious attempts to invade a face recognition system by presenting spoofed faces. State-of-the-art FAS techniques predominantly rely on deep learning models but their cross-domain generalization capabilities are often hindered by the domain shift problem, which arises due to different distributions between training and testing data. In this study, we develop a generalized FAS method under the Efficient Parameter Transfer Learning (EPTL) paradigm, where we adapt the pre-trained Vision Transformer models for the FAS task. During training, the adapter modules are inserted into the pre-trained ViT model, and the adapters are updated while other pre-trained parameters remain fixed. We find the limitations of previous vanilla adapters in that they are based on linear layers, which lack a spoofing-aware inductive bias and thus restrict the cross-domain generalization. To address this limitation and achieve cross-domain generalized FAS, we propose a novel Statistical Adapter (S-Adapter) that gathers local discriminative and statistical information from localized token histograms. To further improve the generalization of the statistical tokens, we propose a novel Token Style Regularization (TSR), which aims to reduce domain style variance by regularizing Gram matrices extracted from tokens across different domains. Our experimental results demonstrate that our proposed S-Adapter and TSR provide significant benefits in both zero-shot and few-shot cross-domain testing, outperforming state-of-the-art methods on several benchmark tests. We will release the source code upon acceptance.

引用

页码：8385 / 8397

页数：13

共 50 条

[41] Learning Inter and Intra Class Variation With Deep Frequency Factorization Network for Face Anti-Spoofing
Liu, Weihua
Li, Qiuyu
Luo, Yiming
Pan, Yushan
Ding, Weiping
Wang, Hao
[J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
[42] On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing
George, Anjith
Marcel, Sebastien
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
[43] FGDNet: Fine-Grained Detection Network Towards Face Anti-Spoofing
Qiao, Tong
Wu, Jiasheng
Zheng, Ning
Xu, Ming
Luo, Xiangyang
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7350 - 7363
[44] Selective Domain-Invariant Feature Alignment Network for Face Anti-Spoofing
Zhou, Lifang
Luo, Jun
Gao, Xinbo
Li, Weisheng
Lei, Bangjun
Leng, Jiaxu
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 5352 - 5365
[45] Quality-Invariant Domain Generalization for Face Anti-Spoofing
Liu, Yongluo
Li, Zun
Xu, Yaowen
Guo, Zhizhi
Zou, Zhaofan
Wu, Lifang
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (11) : 5239 - 5254
[46] Polarized Image Translation From Nonpolarized Cameras for Multimodal Face Anti-Spoofing
Tian, Yu
Huang, Yalin
Zhang, Kunbo
Liu, Yue
Sun, Zhenan
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 5651 - 5664
[47] 3D Face Anti-Spoofing With Dense Squeeze and Excitation Network and Neighborhood-Aware Kernel Adaptation Scheme
Hussein, Mohammed Kareem Hussein
Ucan, Osman Nuri
[J]. IEEE ACCESS, 2025, 13 : 43145 - 43167
[48] Spoofing Attacks and Anti-Spoofing Methods for Face Authentication Over Smartphones
Zheng, Zheng
Wang, Qian
Wang, Cong
[J]. IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (12) : 213 - 219
[49] Dual-Path Adaptive Channel Attention Network Based on Feature Constraints for Face Anti-Spoofing
Li, Nana
Weng, Zhipeng
Liu, Fangmei
Li, Zuhe
Wang, Wei
[J]. IEEE ACCESS, 2025, 13 : 22855 - 22867
[50] Two-Stage Face Detection and Anti-spoofing
Nurnoby, M. Faisal
El-Alfy, El-Sayed M.
[J]. ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 445 - 455

← 1 2 3 4 5 →