Effective near-duplicate image detection using perceptual hashing and deep learning

被引：0

作者：

Jakhar, Yash ^{[1
]}

Borah, Malaya Dutta ^{[1
]}

机构：

[1] Natl Inst Technol, Dept Comp Sci & Engn, Silchar, India

来源：

INFORMATION PROCESSING & MANAGEMENT | 2025年 / 62卷 / 04期

关键词：

Near-duplicate images; Neural network; Generative Adversarial Network; Perceptual hashing; Siamese network; Vision Transformer;

D O I：

10.1016/j.ipm.2025.104086

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Computer vision has always been concerned with near-duplicate image detection. Previous approaches for detecting near duplicates highlighted the necessity to adequately explore the aspect of image transformations for effectively handling complex images. We proposed a method of finding near duplicate images using the integration of three different techniques: perceptual hashing, Siamese network, and Vision Transformer. Perceptual hashing gives us a quick way to filter out similar-looking pictures, while the Siamese network architecture paired with the Vision transformer helps us identify more complex near duplicate instances. The integrated approach learns a metric space from data, which reflects both visual similarity and perceptual closeness among items in the dataset. The results demonstrate the effectiveness and robustness of our proposed method, achieving an AUROC of 0.99 and a precision of 0.987 on the California- ND dataset, and an AUROC of 0.92 with a precision of 0.884 on the INRIA Holidays dataset, significantly outperforming traditional methods by over 10% in both metrics. This represents a significant step forward in near-duplicate image detection research.

引用

页数：12

共 38 条

[1] DISCRETE COSINE TRANSFORM
AHMED, N
NATARAJAN, T
RAO, KR
[J]. IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) : 90 - 93
[2] Albawi S, 2017, I C ENG TECHNOL
[3] An Shan, 2017, P 2017 INT C DEEP LE, P75
[4] Babenko A, 2015, Arxiv, DOI arXiv:1510.07493
[5] An improved dense-to-sparse cross-modal fusion network for 3D object detection in RGB-D images
Chen, Yan
Ni, Jianjun
Tang, Guangyi
Cao, Weidong
Yang, Simon X.
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 12159 - 12184
[6] Generative Adversarial Networks An overview
Creswell, Antonia
White, Tom
Dumoulin, Vincent
Arulkumaran, Kai
Sengupta, Biswa
Bharath, Anil A.
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) : 53 - 65
[7] Douze M., 2009, P ACM INT C IM VID R, P1
[8] Farid H., 2021, Journal of Online Trust and Safety, V1, P1, DOI 10.54501/jots.v1i1.24
[9] Franke K., 2017, Digital forensics, P313
[10] Near duplicate detection of images with area and proposed pixel-based feature extraction
Governor, Kalaiarasi
Ramanujam, Padmavathy
Mana, Suja Cherukullapurath
Perumal, Geetha
[J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (02)

← 1 2 3 4 →