Effective near-duplicate image detection using perceptual hashing and deep learning

被引:0
作者
Jakhar, Yash [1 ]
Borah, Malaya Dutta [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Silchar, India
关键词
Near-duplicate images; Neural network; Generative Adversarial Network; Perceptual hashing; Siamese network; Vision Transformer;
D O I
10.1016/j.ipm.2025.104086
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computer vision has always been concerned with near-duplicate image detection. Previous approaches for detecting near duplicates highlighted the necessity to adequately explore the aspect of image transformations for effectively handling complex images. We proposed a method of finding near duplicate images using the integration of three different techniques: perceptual hashing, Siamese network, and Vision Transformer. Perceptual hashing gives us a quick way to filter out similar-looking pictures, while the Siamese network architecture paired with the Vision transformer helps us identify more complex near duplicate instances. The integrated approach learns a metric space from data, which reflects both visual similarity and perceptual closeness among items in the dataset. The results demonstrate the effectiveness and robustness of our proposed method, achieving an AUROC of 0.99 and a precision of 0.987 on the California- ND dataset, and an AUROC of 0.92 with a precision of 0.884 on the INRIA Holidays dataset, significantly outperforming traditional methods by over 10% in both metrics. This represents a significant step forward in near-duplicate image detection research.
引用
收藏
页数:12
相关论文
共 38 条
  • [11] A Twofold Siamese Network for Real-Time Object Tracking
    He, Anfeng
    Luo, Chong
    Tian, Xinmei
    Zeng, Wenjun
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4834 - 4843
  • [12] Deep Constrained Siamese Hash Coding Network and Load-Balanced Locality-Sensitive Hashing for Near Duplicate Image Detection
    Hu, Weiming
    Fan, Yabo
    Xing, Junliang
    Sun, Liang
    Cai, Zhaoquan
    Maybank, Stephen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (09) : 4452 - 4464
  • [13] Jegou H, 2008, LECT NOTES COMPUT SC, V5302, P304, DOI 10.1007/978-3-540-88682-2_24
  • [14] Jinda-Apiraksa A, 2013, INT WORK QUAL MULTIM, P142, DOI 10.1109/QoMEX.2013.6603227
  • [15] Ke Yan, 2004, ACM Multimedia, P5
  • [16] Evaluating the Performance of ResNet Model Based on Image Recognition
    Khan, Riaz Ullah
    Zhang, Xiaosong
    Kumar, Rajesh
    Aboagye, Emelia Opoku
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE (ICCAI 2018), 2018, : 86 - 90
  • [17] Revisiting Gist-PCA Hashing for Near Duplicate Image Detection
    Kim, Hyunwoo
    Sohn, SungRyull
    Kim, Junmo
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 91 (06): : 575 - 586
  • [18] Kumar Akshi, 2020, Procedia Computer Science, V173, P8
  • [19] Enhancing Security of Medical Images Using Deep Learning, Chaotic Map, and Hash Table
    Kumar, Piyush
    Rahman, Mobashshirur
    Namasudra, Suyel
    Moparthi, Nageswara Rao
    [J]. MOBILE NETWORKS & APPLICATIONS, 2023, 29 (5) : 1489 - 1503
  • [20] MS-RMAC: Multiscale Regional Maximum Activation of Convolutions for Image Retrieval
    Li, Yang
    Xu, Yulong
    Wang, Jiabao
    Miao, Zhuang
    Zhang, Yafei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (05) : 609 - 613