What Makes Paris Look like Paris?

被引:415
作者
Doersch, Carl [1 ]
Singh, Saurabh [1 ]
Gupta, Abhinav [1 ]
Sivic, Josef [2 ]
Efros, Alexei A. [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Ecole Normale Super, INRIA, F-75231 Paris, France
来源
ACM TRANSACTIONS ON GRAPHICS | 2012年 / 31卷 / 04期
关键词
data mining; visual summarization; reference art; big data; computational geography; visual perception; IMAGE; RECOGNITION; SELECTION; FEATURES; SCENE;
D O I
10.1145/2185520.2185597
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given a large repository of geotagged imagery, we seek to automatically find visual elements, e. g. windows, balconies, and street signs, that are most distinctive for a certain geo-spatial area, for example the city of Paris. This is a tremendously difficult task as the visual features distinguishing architectural elements of different places can be very subtle. In addition, we face a hard search problem: given all possible patches in all images, which of them are both frequently occurring and geographically informative? To address these issues, we propose to use a discriminative clustering approach able to take into account the weak geographic supervision. We show that geographically representative image elements can be discovered automatically from Google Street View imagery in a discriminative manner. We demonstrate that these elements are visually interpretable and perceptually geo-informative. The discovered visual elements can also support a variety of computational geography tasks, such as mapping architectural correspondences and influences within and across cities, finding representative elements at different geo-spatial scales, and geographically-informed image retrieval.
引用
收藏
页数:9
相关论文
共 35 条
[1]  
[Anonymous], 2 INT VIS WORKSH C C
[2]  
[Anonymous], 2009, P 18 INT C WORLD WID
[3]  
[Anonymous], ARXIV12053137CSCV
[4]  
[Anonymous], 2006, CVPR
[5]  
[Anonymous], ART RATATOUILLE
[6]  
[Anonymous], CTUCMP201116
[7]  
[Anonymous], ADV NEURAL INFORM PR
[8]   Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations [J].
Bourdev, Lubomir ;
Malik, Jitendra .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :1365-1372
[9]  
Chen DM, 2011, PROC CVPR IEEE, P737, DOI 10.1109/CVPR.2011.5995610
[10]  
Chum O, 2009, PROC CVPR IEEE, P17, DOI 10.1109/CVPRW.2009.5206531