Nonparametric Scene Parsing via Label Transfer

被引：241

作者：

Liu, Ce ^{[1
,2
]}

Yuen, Jenny ^{[2
]}

Torralba, Antonio ^{[2
]}

机构：

[1] Microsoft Res New England, Cambridge, MA 02142 USA

[2] MIT, CSAIL, Cambridge, MA 02139 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2011年 / 33卷 / 12期

基金：

美国国家科学基金会;

关键词：

Object recognition; scene parsing; label transfer; SIFT flow; Markov random fields; OBJECT; TEXTURE;

D O I：

10.1109/TPAMI.2011.131

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While there has been a lot of recent work on object recognition and image understanding, the focus has been on carefully establishing mathematical models for images, scenes, and objects. In this paper, we propose a novel, nonparametric approach for object recognition and scene parsing using a new technology we name label transfer. For an input image, our system first retrieves its nearest neighbors from a large database containing fully annotated images. Then, the system establishes dense correspondences between the input image and each of the nearest neighbors using the dense SIFT flow algorithm [28], which aligns two images based on local image structures. Finally, based on the dense scene correspondences obtained from SIFT flow, our system warps the existing annotations and integrates multiple cues in a Markov random field framework to segment and recognize the query image. Promising experimental results have been achieved by our nonparametric scene parsing system on challenging databases. Compared to existing object recognition approaches that require training classifiers or appearance models for each object category, our system is easy to implement, has few parameters, and embeds contextual information naturally in the retrieval/alignment procedure.

引用

页码：2368 / 2382

页数：15

共 50 条

[41] Weakly-supervised scene parsing with multiple contextual cues
Li, Teng
Wu, Xinyu
Ni, Bingbing
Lu, Ke
Yan, Shuicheng
INFORMATION SCIENCES, 2015, 323 : 59 - 72
[42] Automatic image annotation via label transfer in the semantic space
Uricchio, Tiberio
Ballan, Lamberto
Seidenari, Lorenzo
Del Bimbo, Alberto
PATTERN RECOGNITION, 2017, 71 : 144 - 157
[43] Semantic combined network for zero-shot scene parsing
Wang, Yinduo
Zhang, Haofeng
Wang, Shidong
Long, Yang
Yang, Longzhi
IET IMAGE PROCESSING, 2020, 14 (04) : 757 - 765
[44] ORDNet: Capturing Omni-Range Dependencies for Scene Parsing
Huang, Shaofei
Liu, Si
Hui, Tianrui
Han, Jizhong
Li, Bo
Feng, Jiashi
Yan, Shuicheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8251 - 8263
[45] Non-parametric spatially constrained local prior for scene parsing on real-world data
Zhang, Ligang
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 93
[46] Scene Parsing and Fusion-Based Continuous Traversable Region Formation
Xiao, Xuhong
Ng, Gee Wah
Tan, Yuan Sin
Chuan, Yeo Ye
COMPUTER VISION - ACCV 2014 WORKSHOPS, PT I, 2015, 9008 : 383 - 398
[47] PIG: Prompt Images Guidance for Night-Time Scene Parsing
Xie, Zhifeng
Qiu, Rui
Wang, Sen
Tan, Xin
Xie, Yuan
Ma, Lizhuang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 3921 - 3934
[48] Scene parsing using graph matching on street-view data
Yu, Tianshu
Wang, Ruisheng
COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 145 : 70 - 80
[49] PSANet: Point-wise Spatial Attention Network for Scene Parsing
Zhao, Hengshuang
Zhang, Yi
Liu, Shu
Shi, Jianping
Loy, Chen Change
Lin, Dahua
Jia, Jiaya
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 270 - 286
[50] Multi-task Learning for Bi-temporal Remote Sensing Scene Parsing via Patch-pixel Representation
Fu, Chenqin
Bao, Tengfei
Lv, Liang
Sirajidin, Salayidin
Fang, Tao
Huo, Hong
ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 360 - 367

← 1 2 3 4 5 →