Nonparametric Scene Parsing via Label Transfer

被引:241
|
作者
Liu, Ce [1 ,2 ]
Yuen, Jenny [2 ]
Torralba, Antonio [2 ]
机构
[1] Microsoft Res New England, Cambridge, MA 02142 USA
[2] MIT, CSAIL, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Object recognition; scene parsing; label transfer; SIFT flow; Markov random fields; OBJECT; TEXTURE;
D O I
10.1109/TPAMI.2011.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.
引用
收藏
页码:2368 / 2382
页数:15
相关论文
共 50 条
  • [41] Weakly-supervised scene parsing with multiple contextual cues
    Li, Teng
    Wu, Xinyu
    Ni, Bingbing
    Lu, Ke
    Yan, Shuicheng
    INFORMATION SCIENCES, 2015, 323 : 59 - 72
  • [42] Automatic image annotation via label transfer in the semantic space
    Uricchio, Tiberio
    Ballan, Lamberto
    Seidenari, Lorenzo
    Del Bimbo, Alberto
    PATTERN RECOGNITION, 2017, 71 : 144 - 157
  • [43] Semantic combined network for zero-shot scene parsing
    Wang, Yinduo
    Zhang, Haofeng
    Wang, Shidong
    Long, Yang
    Yang, Longzhi
    IET IMAGE PROCESSING, 2020, 14 (04) : 757 - 765
  • [44] ORDNet: Capturing Omni-Range Dependencies for Scene Parsing
    Huang, Shaofei
    Liu, Si
    Hui, Tianrui
    Han, Jizhong
    Li, Bo
    Feng, Jiashi
    Yan, Shuicheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8251 - 8263
  • [45] Non-parametric spatially constrained local prior for scene parsing on real-world data
    Zhang, Ligang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 93
  • [46] Scene Parsing and Fusion-Based Continuous Traversable Region Formation
    Xiao, Xuhong
    Ng, Gee Wah
    Tan, Yuan Sin
    Chuan, Yeo Ye
    COMPUTER VISION - ACCV 2014 WORKSHOPS, PT I, 2015, 9008 : 383 - 398
  • [47] PIG: Prompt Images Guidance for Night-Time Scene Parsing
    Xie, Zhifeng
    Qiu, Rui
    Wang, Sen
    Tan, Xin
    Xie, Yuan
    Ma, Lizhuang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3921 - 3934
  • [48] Scene parsing using graph matching on street-view data
    Yu, Tianshu
    Wang, Ruisheng
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 145 : 70 - 80
  • [49] PSANet: Point-wise Spatial Attention Network for Scene Parsing
    Zhao, Hengshuang
    Zhang, Yi
    Liu, Shu
    Shi, Jianping
    Loy, Chen Change
    Lin, Dahua
    Jia, Jiaya
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 270 - 286
  • [50] Multi-task Learning for Bi-temporal Remote Sensing Scene Parsing via Patch-pixel Representation
    Fu, Chenqin
    Bao, Tengfei
    Lv, Liang
    Sirajidin, Salayidin
    Fang, Tao
    Huo, Hong
    ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 360 - 367