Nonparametric Scene Parsing via Label Transfer

被引:241
|
作者
Liu, Ce [1 ,2 ]
Yuen, Jenny [2 ]
Torralba, Antonio [2 ]
机构
[1] Microsoft Res New England, Cambridge, MA 02142 USA
[2] MIT, CSAIL, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Object recognition; scene parsing; label transfer; SIFT flow; Markov random fields; OBJECT; TEXTURE;
D O I
10.1109/TPAMI.2011.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.
引用
收藏
页码:2368 / 2382
页数:15
相关论文
共 50 条
  • [1] Nonparametric scene parsing in the images of buildings
    Talebi, Mehdi
    Vafaei, Abbas
    Monadjemi, S. Amirhassan
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 70 : 777 - 788
  • [2] Non-parametric scene parsing: Label transfer methods and datasets
    Bhowmick, Alexy
    Saharia, Sarat
    Hazarika, Shyamanta M.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 219
  • [3] Partial similarity based nonparametric scene parsing in certain environment
    Zhang, Honghui
    Fang, Tian
    Chen, Xiaowu
    Zhao, Qinping
    Quan, Long
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [4] NONPARAMETRIC SCENE PARSING WITH DEEP CONVOLUTIONAL FEATURES AND DENSE ALIGNMENT
    Ma, Chih-Hao
    Hsu, Chiou-Ting
    Huet, Benoit
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1915 - 1919
  • [5] Parametric and nonparametric context models: A unified approach to scene parsing
    Aliniya, Parvaneh
    Razzaghi, Parvin
    PATTERN RECOGNITION, 2018, 84 : 165 - 181
  • [6] Nonparametric scene parsing with adaptive feature relevance and semantic context
    Singh, Gautam
    Kosecka, Jana
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3151 - 3157
  • [7] Boosting Scene Parsing Performance via Reliable Scale Prediction
    Shi, Hengcan
    Li, Hongliang
    Wu, Qingbo
    Meng, Fanman
    Ngan, King N.
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 492 - 500
  • [8] Fusion of 3D-LIDAR and camera data for scene parsing
    Zhao, Gangqiang
    Xiao, Xuhong
    Yuan, Junsong
    Ng, Gee Wah
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (01) : 165 - 183
  • [9] Spatially Constrained Location Prior for Scene Parsing
    Zhang, Ligang
    Verma, Brijesh
    Stockwell, David
    Chowdhury, Sujan
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1480 - 1486
  • [10] Scene Parsing From an MAP Perspective
    Li, Xuelong
    Mou, Lichao
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (09) : 1876 - 1886