Efficient Object Embedding for Spliced Image Retrieval

被引:13
作者
Chen, Bor-Chun [1 ,2 ]
Wu, Zuxuan [2 ,3 ]
Davis, Larry S. [1 ]
Lim, Ser-Nam [2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Facebook AI, Menlo Pk, CA 94025 USA
[3] Fudan Univ, Shanghai, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
LOCALIZATION;
D O I
10.1109/CVPR46437.2021.01472
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting spliced images is one of the emerging challenges in computer vision. Unlike prior methods that focus on detecting low-level artifacts generated during the manipulation process, we use an image retrieval approach to tackle this problem. When given a spliced query image, our goal is to retrieve the original image from a database of authentic images. To achieve this goal, we propose representing an image by its constituent objects based on the intuition that the finest granularity of manipulations is often-times at the object-level. We introduce a framework, object embeddings for spliced image retrieval (OE-SIR), that utilizes modern object detectors to localize object regions. Each region is then embedded and collectively used to represent the image. Further, we propose a student-teacher training paradigm for learning discriminative embeddings within object regions to avoid expensive multiple forward passes. Detailed analysis of the efficacy of different feature embedding models is also provided in this study. Extensive experimental results show that the OE-SIR achieves state-of-the-art performance in spliced image retrieval.
引用
收藏
页码:14960 / 14970
页数:11
相关论文
共 74 条
[51]   CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples [J].
Radenovic, Filip ;
Tolias, Giorgos ;
Chum, Ondrej .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :3-20
[52]  
Razavian A. S., 2016, ITE Transactions on Media Technology and Applications, V4, P251
[53]  
REDMON J, 2016, PROC CVPR IEEE, P779, DOI DOI 10.1109/CVPR.2016.91
[54]   End-to-end weakly-supervised semantic alignment [J].
Rocco, Ignacio ;
Arandjelovic, Relja ;
Sivic, Josef .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6917-6925
[55]   Convolutional neural network architecture for geometric matching [J].
Rocco, Ignacio ;
Arandjelovic, Relja ;
Sivic, Josef .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :39-48
[56]  
Romero A., 2014, ARXIV14126550
[57]   Image retrieval: Current techniques, promising directions, and open issues [J].
Rui, Y ;
Huang, TS ;
Chang, SF .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1999, 10 (01) :39-62
[58]   Image Splicing Localization using a Multi-task Fully Convolutional Network (MFCN) [J].
Salloum, Ronald ;
Ren, Yuzhuo ;
Kuo, C. -C. Jay .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 51 :201-209
[59]   Video Google: A text retrieval approach to object matching in videos [J].
Sivic, J ;
Zisserman, A .
NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, :1470-+
[60]   Deep Metric Learning via Lifted Structured Feature Embedding [J].
Song, Hyun Oh ;
Xiang, Yu ;
Jegelka, Stefanie ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4004-4012