Effective near-duplicate image detection using perceptual hashing and deep learning

被引:0
作者
Jakhar, Yash [1 ]
Borah, Malaya Dutta [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Silchar, India
关键词
Near-duplicate images; Neural network; Generative Adversarial Network; Perceptual hashing; Siamese network; Vision Transformer;
D O I
10.1016/j.ipm.2025.104086
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computer vision has always been concerned with near-duplicate image detection. Previous approaches for detecting near duplicates highlighted the necessity to adequately explore the aspect of image transformations for effectively handling complex images. We proposed a method of finding near duplicate images using the integration of three different techniques: perceptual hashing, Siamese network, and Vision Transformer. Perceptual hashing gives us a quick way to filter out similar-looking pictures, while the Siamese network architecture paired with the Vision transformer helps us identify more complex near duplicate instances. The integrated approach learns a metric space from data, which reflects both visual similarity and perceptual closeness among items in the dataset. The results demonstrate the effectiveness and robustness of our proposed method, achieving an AUROC of 0.99 and a precision of 0.987 on the California- ND dataset, and an AUROC of 0.92 with a precision of 0.884 on the INRIA Holidays dataset, significantly outperforming traditional methods by over 10% in both metrics. This represents a significant step forward in near-duplicate image detection research.
引用
收藏
页数:12
相关论文
共 38 条
  • [1] DISCRETE COSINE TRANSFORM
    AHMED, N
    NATARAJAN, T
    RAO, KR
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) : 90 - 93
  • [2] Albawi S, 2017, I C ENG TECHNOL
  • [3] An Shan, 2017, P 2017 INT C DEEP LE, P75
  • [4] Babenko A, 2015, Arxiv, DOI arXiv:1510.07493
  • [5] An improved dense-to-sparse cross-modal fusion network for 3D object detection in RGB-D images
    Chen, Yan
    Ni, Jianjun
    Tang, Guangyi
    Cao, Weidong
    Yang, Simon X.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 12159 - 12184
  • [6] Generative Adversarial Networks An overview
    Creswell, Antonia
    White, Tom
    Dumoulin, Vincent
    Arulkumaran, Kai
    Sengupta, Biswa
    Bharath, Anil A.
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) : 53 - 65
  • [7] Douze M., 2009, P ACM INT C IM VID R, P1
  • [8] Farid H., 2021, Journal of Online Trust and Safety, V1, P1, DOI 10.54501/jots.v1i1.24
  • [9] Franke K., 2017, Digital forensics, P313
  • [10] Near duplicate detection of images with area and proposed pixel-based feature extraction
    Governor, Kalaiarasi
    Ramanujam, Padmavathy
    Mana, Suja Cherukullapurath
    Perumal, Geetha
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (02)