What Makes Paris Look like Paris?

被引：431

作者：

Doersch, Carl ^{[1
]}

Singh, Saurabh ^{[1
]}

Gupta, Abhinav ^{[1
]}

Sivic, Josef ^{[2
]}

Efros, Alexei A. ^{[1
,2
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[2] Ecole Normale Super, INRIA, F-75231 Paris, France

来源：

ACM TRANSACTIONS ON GRAPHICS | 2012年 / 31卷 / 04期

关键词：

data mining; visual summarization; reference art; big data; computational geography; visual perception; IMAGE; RECOGNITION; SELECTION; FEATURES; SCENE;

D O I：

10.1145/2185520.2185597

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Given a large repository of geotagged imagery, we seek to automatically find visual elements, e. g. windows, balconies, and street signs, that are most distinctive for a certain geo-spatial area, for example the city of Paris. This is a tremendously difficult task as the visual features distinguishing architectural elements of different places can be very subtle. In addition, we face a hard search problem: given all possible patches in all images, which of them are both frequently occurring and geographically informative? To address these issues, we propose to use a discriminative clustering approach able to take into account the weak geographic supervision. We show that geographically representative image elements can be discovered automatically from Google Street View imagery in a discriminative manner. We demonstrate that these elements are visually interpretable and perceptually geo-informative. The discovered visual elements can also support a variety of computational geography tasks, such as mapping architectural correspondences and influences within and across cities, finding representative elements at different geo-spatial scales, and geographically-informed image retrieval.

引用

页数：9

共 35 条

[1]

[Anonymous], 2 INT VIS WORKSH C C

[2]

[Anonymous], 2009, P 18 INT C WORLD WID

[3]

[Anonymous], ARXIV12053137CSCV

[4]

[Anonymous], 2006, CVPR

[5]

[Anonymous], ART RATATOUILLE

[6]

[Anonymous], CTUCMP201116

[7]

[Anonymous], ADV NEURAL INFORM PR

[8] Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations [J].

Bourdev, Lubomir ;

Malik, Jitendra .

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :1365-1372

[9]

Chen DM, 2011, PROC CVPR IEEE, P737, DOI 10.1109/CVPR.2011.5995610

[10]

Chum O, 2009, PROC CVPR IEEE, P17, DOI 10.1109/CVPRW.2009.5206531

← 1 2 3 4 →