Parsing Facades with Shape Grammars and Reinforcement Learning

被引:56
作者
Teboul, Olivier [1 ,2 ]
Kokkinos, Iasonas [3 ]
Simon, Loic [4 ]
Koutsourakis, Panagiotis [5 ]
Paragios, Nikos [6 ]
机构
[1] Ecole Cent Paris, MAS Lab, F-92290 Chatenay Malabry, France
[2] Google Inc, BR-30170010 Belo Horizonte, MG, Brazil
[3] Ecole Cent Paris, INRIA Saclay, F-92295 Chatenay Malabry, France
[4] Ecole Natl Super Ingn Caen, CNRS, GREYC UMR 6072, F-14050 Caen, France
[5] Univ Crete, Ecole Cent Paris, F-92295 Chatenay Malabry, France
[6] Ecole Cent Paris, Ecole Ponts, ParisTech, INRIA Saclay, F-92295 Chatenay Malabry, France
关键词
Image arsing; shape grammar; reinforcement learning; semantic segmentation; data-driven exploration; Markov decision processes; SEGMENTATION;
D O I
10.1109/TPAMI.2012.252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we use shape grammars (SGs) for facade parsing, which amounts to segmenting 2D building facades into balconies, walls, windows, and doors in an architecturally meaningful manner. The main thrust of our work is the introduction of reinforcement learning (RL) techniques to deal with the computational complexity of the problem. RL provides us with techniques such as Q-learning and state aggregation which we exploit to efficiently solve facade parsing. We initially phrase the 1D parsing problem in terms of a Markov Decision Process, paving the way for the application of RL-based tools. We then develop novel techniques for the 2D shape parsing problem that take into account the specificities of the facade parsing problem. Specifically, we use state aggregation to enforce the symmetry of facade floors and demonstrate how to use RL to exploit bottom-up, image-based guidance during optimization. We provide systematic results on the Paris building dataset and obtain state-of-the-art results in a fraction of the time required by previous methods. We validate our method under diverse imaging conditions and make our software and results available online.
引用
收藏
页码:1744 / 1756
页数:13
相关论文
共 44 条
[1]  
[Anonymous], P IEEE C COMP VIS PA
[2]  
[Anonymous], P INT WORKSH COMB IM
[3]  
[Anonymous], PHOTOGRAMMETRIC IMAG
[4]  
[Anonymous], 2004, P EUR C COMP VIS
[5]  
[Anonymous], P VIS MOD VIS WORKSH
[6]  
[Anonymous], 2004, P INT WORKSH VIS TEC
[7]  
[Anonymous], P AS C COMP VIS
[8]  
[Anonymous], 2007, INT J COMPUTER VISIO, DOI DOI 10.1007/S11263-007-0109-1
[9]  
[Anonymous], THESIS BRAUNSCHWEIG
[10]  
[Anonymous], P 5 EUR C COMP AESTH