S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing With Statistical Tokens

被引:2
作者
Cai, Rizhao [1 ]
Yu, Zitong [2 ]
Kong, Chenqi [1 ]
Li, Haoliang [3 ]
Chen, Changsheng [4 ,5 ]
Hu, Yongjian [6 ,7 ]
Kot, Alex C. [1 ]
机构
[1] Nanyang Technol Univ, Sch EEE, ROSE Lab, Singapore 639798, Singapore
[2] Great Bay Univ, Sch Comp & Informat Technol, Shantou 523000, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[4] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen Key Lab Media Secur, State Key Lab Radiofrequency Heterogeneous Integra, Shenzhen 518060, Peoples R China
[5] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[6] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 511442, Peoples R China
[7] China Singapore Int Joint Res Inst, Guangzhou 510555, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptation models; Face recognition; Training; Histograms; Data models; Feature extraction; Faces; Vision transformer (ViT); adapter; histogram; face anti-spoofing; face presentation attack detection; domain generalization; PRESENTATION ATTACK DETECTION; ADAPTATION;
D O I
10.1109/TIFS.2024.3420699
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Face Anti-Spoofing (FAS) aims to detect malicious attempts to invade a face recognition system by presenting spoofed faces. State-of-the-art FAS techniques predominantly rely on deep learning models but their cross-domain generalization capabilities are often hindered by the domain shift problem, which arises due to different distributions between training and testing data. In this study, we develop a generalized FAS method under the Efficient Parameter Transfer Learning (EPTL) paradigm, where we adapt the pre-trained Vision Transformer models for the FAS task. During training, the adapter modules are inserted into the pre-trained ViT model, and the adapters are updated while other pre-trained parameters remain fixed. We find the limitations of previous vanilla adapters in that they are based on linear layers, which lack a spoofing-aware inductive bias and thus restrict the cross-domain generalization. To address this limitation and achieve cross-domain generalized FAS, we propose a novel Statistical Adapter (S-Adapter) that gathers local discriminative and statistical information from localized token histograms. To further improve the generalization of the statistical tokens, we propose a novel Token Style Regularization (TSR), which aims to reduce domain style variance by regularizing Gram matrices extracted from tokens across different domains. Our experimental results demonstrate that our proposed S-Adapter and TSR provide significant benefits in both zero-shot and few-shot cross-domain testing, outperforming state-of-the-art methods on several benchmark tests. We will release the source code upon acceptance.
引用
收藏
页码:8385 / 8397
页数:13
相关论文
共 50 条
  • [41] Learning Inter and Intra Class Variation With Deep Frequency Factorization Network for Face Anti-Spoofing
    Liu, Weihua
    Li, Qiuyu
    Luo, Yiming
    Pan, Yushan
    Ding, Weiping
    Wang, Hao
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [42] On the Effectiveness of Vision Transformers for Zero-shot Face Anti-Spoofing
    George, Anjith
    Marcel, Sebastien
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [43] FGDNet: Fine-Grained Detection Network Towards Face Anti-Spoofing
    Qiao, Tong
    Wu, Jiasheng
    Zheng, Ning
    Xu, Ming
    Luo, Xiangyang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7350 - 7363
  • [44] Selective Domain-Invariant Feature Alignment Network for Face Anti-Spoofing
    Zhou, Lifang
    Luo, Jun
    Gao, Xinbo
    Li, Weisheng
    Lei, Bangjun
    Leng, Jiaxu
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 5352 - 5365
  • [45] Quality-Invariant Domain Generalization for Face Anti-Spoofing
    Liu, Yongluo
    Li, Zun
    Xu, Yaowen
    Guo, Zhizhi
    Zou, Zhaofan
    Wu, Lifang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (11) : 5239 - 5254
  • [46] Polarized Image Translation From Nonpolarized Cameras for Multimodal Face Anti-Spoofing
    Tian, Yu
    Huang, Yalin
    Zhang, Kunbo
    Liu, Yue
    Sun, Zhenan
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 5651 - 5664
  • [47] 3D Face Anti-Spoofing With Dense Squeeze and Excitation Network and Neighborhood-Aware Kernel Adaptation Scheme
    Hussein, Mohammed Kareem Hussein
    Ucan, Osman Nuri
    [J]. IEEE ACCESS, 2025, 13 : 43145 - 43167
  • [48] Spoofing Attacks and Anti-Spoofing Methods for Face Authentication Over Smartphones
    Zheng, Zheng
    Wang, Qian
    Wang, Cong
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (12) : 213 - 219
  • [49] Dual-Path Adaptive Channel Attention Network Based on Feature Constraints for Face Anti-Spoofing
    Li, Nana
    Weng, Zhipeng
    Liu, Fangmei
    Li, Zuhe
    Wang, Wei
    [J]. IEEE ACCESS, 2025, 13 : 22855 - 22867
  • [50] Two-Stage Face Detection and Anti-spoofing
    Nurnoby, M. Faisal
    El-Alfy, El-Sayed M.
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 445 - 455