Efficient Object Embedding for Spliced Image Retrieval

被引：12

作者：

Chen, Bor-Chun ^{[1
,2
]}

Wu, Zuxuan ^{[2
,3
]}

Davis, Larry S. ^{[1
]}

Lim, Ser-Nam ^{[2
]}

机构：

[1] Univ Maryland, College Pk, MD 20742 USA

[2] Facebook AI, Menlo Pk, CA 94025 USA

[3] Fudan Univ, Shanghai, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

LOCALIZATION;

D O I：

10.1109/CVPR46437.2021.01472

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting spliced images is one of the emerging challenges in computer vision. Unlike prior methods that focus on detecting low-level artifacts generated during the manipulation process, we use an image retrieval approach to tackle this problem. When given a spliced query image, our goal is to retrieve the original image from a database of authentic images. To achieve this goal, we propose representing an image by its constituent objects based on the intuition that the finest granularity of manipulations is often-times at the object-level. We introduce a framework, object embeddings for spliced image retrieval (OE-SIR), that utilizes modern object detectors to localize object regions. Each region is then embedded and collectively used to represent the image. Further, we propose a student-teacher training paradigm for learning discriminative embeddings within object regions to avoid expensive multiple forward passes. Detailed analysis of the efficacy of different feature embedding models is also provided in this study. Extensive experimental results show that the OE-SIR achieves state-of-the-art performance in spliced image retrieval.

引用

页码：14960 / 14970

页数：11

共 74 条

[1]

[Anonymous], 2005, PROC CVPR IEEE

[2]

[Anonymous], 2014, IEEE COMPUT SOC CONF, DOI [DOI 10.1109/CVPRW.2014.131, 10.1109/cvprw.2014.131]

[3]

[Anonymous], 2018, ADV NEURAL INFORM PR

[4]

Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]

[5] Factors of Transferability for a Generic ConvNet Representation [J].

Azizpour, Hossein ;

Razavian, Ali Sharif ;

Sullivan, Josephine ;

Maki, Atsuto ;

Carlsson, Stefan .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (09) :1790-1802

[6]

Ba LJ., 2013, ADV NEURAL INFORM PR, V3, P2654, DOI DOI 10.5555/2969033.2969123

[7] Neural Codes for Image Retrieval [J].

Babenko, Artem ;

Slesarev, Anton ;

Chigorin, Alexandr ;

Lempitsky, Victor .

COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :584-599

[8] Exploiting Spatial Structure for Localizing Manipulated Image Regions [J].

Bappy, Jawadul H. ;

Roy-Chowdhury, Amit K. ;

Bunk, Jason ;

Nataraj, Lakshmanan ;

Manjunath, B. S. .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4980-4989

[9]

Bucilua C., 2006, P 12 ACM SIGKDD INT, P535, DOI [DOI 10.1145/1150402.1150464, 10.1145/1150402.1150464]

[10] Toward Realistic Image Compositing with Adversarial Learning [J].

Chen, Bor-Chun ;

Kae, Andrew .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8407-8416

← 1 2 3 4 5 6 7 8 →