Perceptual Hashing Using Pretrained Vision Transformers

被引:0
|
作者
De Geest, Jelle [1 ]
De Smet, Patrick [2 ]
Bonetto, Lucio [2 ]
Lambert, Peter [1 ]
Van Wallendael, Glenn [1 ]
Mareen, Hannes [1 ]
机构
[1] Univ Ghent, Imec, Dept Elect & Informat Syst, Technol Pk Zwijnaarde 122, B-9052 Ghent, Belgium
[2] Natl Inst Criminalist & Criminol NICC, Vilvoordsesteenweg 100, B-1120 Brussels, Belgium
来源
2024 IEEE GAMING, ENTERTAINMENT, AND MEDIA CONFERENCE, GEM 2024 | 2024年
关键词
Perceptual Hashing; Vision Transformer; Image Forensics;
D O I
10.1109/GEM61861.2024.10585453
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The rapid evolution of digital image circulation has necessitated robust techniques for image identification and comparison, particularly for sensitive applications such as detecting Child Sexual Abuse Material (CSAM) and preventing the spread of harmful content online. Traditional perceptual hashing methods, while useful, fall short when exposed to some common image transformations, or when images are doctored to avoid detection, rendering them ineffective for nuanced comparisons. Addressing this challenge, this paper introduces a novel pretrained vision transformer artificial intelligence (AI) model approach that enhances the robustness and accuracy of perceptual hashing. Leveraging a pretrained Vision Transformer (ViT-L/14), our approach integrates visual and textual data processing to generate feature arrays that represent perceptual image hashes. Through a comprehensive evaluation using a dataset of 50,000 images, we demonstrate that our method offers significant improvements in detecting similarities for certain complex image transformations, aligning more closely with human visual perception than conventional methods. While our method presents certain initial drawbacks such as larger hash sizes and high computational complexity, its ability to better handle perceptual nuances presents a forward step in the realm of image forensics. The potential applications of this research extend to law enforcement, digital media management, and the broader domain of content verification, setting the stage for more secure and efficient digital content analysis.
引用
收藏
页码:19 / 24
页数:6
相关论文
共 50 条
  • [41] A Novel Approach To Lion Re-Identification Using Vision Transformers
    Matlala, Boitumelo
    van der Haar, Dustin
    Vandapalli, Hima
    ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2024, 2025, 2326 : 270 - 281
  • [42] Robust perceptual fingerprint image hashing: a comparative study
    Birouk, Wafa
    Lahoulou, Atidel
    Melit, Ali
    Bouridane, Ahmed
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2023, 15 (01) : 59 - 77
  • [43] PERCEPTUAL IMAGE HASHING BASED ON WEIGHTED LLE AND FMT
    Aryan, Jamaluddin
    Wei, Guo
    Abdullahi, Sani M.
    2019 16TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICWAMTIP), 2019, : 311 - 316
  • [44] Artificial Cognition for Early Leaf Disease Detection using Vision Transformers
    Huy-Tan Thai
    Nhu-Y Tran-Van
    Kim-Hung Le
    2021 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2021), 2021, : 33 - 38
  • [45] Medicinal Plant Leaf Classification using Deep Learning and Vision Transformers
    Hossain, Shahriar
    Hasan, Rizbanul
    Uddin, Jia
    BAGHDAD SCIENCE JOURNAL, 2025, 22 (03) : 1065 - 1076
  • [46] Contrastive hashing with vision transformer for image retrieval
    Ren, Xiuxiu
    Zheng, Xiangwei
    Zhou, Huiyu
    Liu, Weilong
    Dong, Xiao
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 12192 - 12211
  • [47] DynaSlim: Dynamic Slimming for Vision Transformers
    Shi, Da
    Gao, Jingsheng
    Liu, Ting
    Fu, Yuzhuo
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1451 - 1456
  • [48] Vision Transformers for Single Image Dehazing
    Song, Yuda
    He, Zhuqing
    Qian, Hui
    Du, Xin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1927 - 1941
  • [49] Vision Transformers for Brain Tumor Classification
    Simon, Eliott
    Briassouli, Alexia
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES (BIOIMAGING), VOL 2, 2021, : 123 - 130
  • [50] An Image Perceptual Hashing Algorithm Based on Convolutional Neural Networks
    Yang, Meihong
    Qi, Baolin
    Xian, Yongjin
    Li, Jian
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 95 - 108