Perceptual Hashing Using Pretrained Vision Transformers

被引:0
|
作者
De Geest, Jelle [1 ]
De Smet, Patrick [2 ]
Bonetto, Lucio [2 ]
Lambert, Peter [1 ]
Van Wallendael, Glenn [1 ]
Mareen, Hannes [1 ]
机构
[1] Univ Ghent, Imec, Dept Elect & Informat Syst, Technol Pk Zwijnaarde 122, B-9052 Ghent, Belgium
[2] Natl Inst Criminalist & Criminol NICC, Vilvoordsesteenweg 100, B-1120 Brussels, Belgium
来源
2024 IEEE GAMING, ENTERTAINMENT, AND MEDIA CONFERENCE, GEM 2024 | 2024年
关键词
Perceptual Hashing; Vision Transformer; Image Forensics;
D O I
10.1109/GEM61861.2024.10585453
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The rapid evolution of digital image circulation has necessitated robust techniques for image identification and comparison, particularly for sensitive applications such as detecting Child Sexual Abuse Material (CSAM) and preventing the spread of harmful content online. Traditional perceptual hashing methods, while useful, fall short when exposed to some common image transformations, or when images are doctored to avoid detection, rendering them ineffective for nuanced comparisons. Addressing this challenge, this paper introduces a novel pretrained vision transformer artificial intelligence (AI) model approach that enhances the robustness and accuracy of perceptual hashing. Leveraging a pretrained Vision Transformer (ViT-L/14), our approach integrates visual and textual data processing to generate feature arrays that represent perceptual image hashes. Through a comprehensive evaluation using a dataset of 50,000 images, we demonstrate that our method offers significant improvements in detecting similarities for certain complex image transformations, aligning more closely with human visual perception than conventional methods. While our method presents certain initial drawbacks such as larger hash sizes and high computational complexity, its ability to better handle perceptual nuances presents a forward step in the realm of image forensics. The potential applications of this research extend to law enforcement, digital media management, and the broader domain of content verification, setting the stage for more secure and efficient digital content analysis.
引用
收藏
页码:19 / 24
页数:6
相关论文
共 50 条
  • [21] Efficient and Robust Perceptual Hashing Using Log-Polar Image Representation
    Plesca, Cezar
    Morogan, Luciana
    2014 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2014,
  • [22] A perceptual hashing method based on luminance features
    Luo, Siqing
    PIAGENG 2010: PHOTONICS AND IMAGING FOR AGRICULTURAL ENGINEERING, 2010, 7752
  • [23] Perceptual Hashing of Cyclostationary Signal with Sparse Coding
    Liu, Haining
    Huang, Yixiang
    Liu, Chengliang
    Zhang, Jinkai
    2018 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2018,
  • [24] PERCEPTUAL HASHING FOR CONTENT BASED IMAGE RETRIEVAL
    Meenalochini, M.
    Saranya, K.
    Rajkumar, G. V.
    Mahto, Akash
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 235 - 238
  • [25] PHASER: Perceptual hashing algorithms evaluation and results - An open source forensic framework
    McKeown, Sean
    Aaby, Peter
    Steyven, Andreas
    FORENSIC SCIENCE INTERNATIONAL-DIGITAL INVESTIGATION, 2024, 48
  • [26] Image forgery classification and localization through vision transformers
    Pawar, Digambar
    Gowda, Raghavendra
    Chandra, Krishna
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (01)
  • [27] Vision Transformers with Hierarchical Attention
    Liu, Yun
    Wu, Yu-Huan
    Sun, Guolei
    Zhang, Le
    Chhatkuli, Ajad
    Van Gool, Luc
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (04) : 670 - 683
  • [28] Constituent Attention for Vision Transformers
    Li, Haoling
    Xue, Mengqi
    Song, Jie
    Zhang, Haofei
    Huang, Wenqi
    Liang, Lingyu
    Song, Mingli
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 237
  • [29] Distance distributions and runtime analysis of perceptual hashing algorithms
    Sharma, Shivdutt
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [30] A New Methodology for Condition Monitoring Based on Perceptual Hashing
    Liu, Haining
    Men, Xiuhua
    Li, Fajia
    Zhang, Jinkai
    Wang, Xiaohong
    Liu, Chengliang
    PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 919 - 923