Sharing visual features for multiclass and multiview object detection

被引：336

作者：

Torralba, Antonio

Murphy, Kevin P.

Freeman, William T.

机构：

[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA

[2] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada

[3] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z4, Canada

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2007年 / 29卷 / 05期

基金：

美国国家科学基金会;

关键词：

object detection; interclass transfer; sharing features; boosting; multiclass;

D O I：

10.1109/TPAMI.2007.1055

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider the problem of detecting a large number of different classes of objects in cluttered scenes. Traditional approaches require applying a battery of different classifiers to the image, at multiple locations and scales. This can be slow and can require a lot of training data since each classifier requires the computation of many different image features. In particular, for independently trained detectors, the ( runtime) computational complexity and the ( training-time) sample complexity scale linearly with the number of classes to be detected. We present a multitask learning procedure, based on boosted decision stumps, that reduces the computational and sample complexity by finding common features that can be shared across the classes ( and/or views). The detectors for each class are trained jointly, rather than independently. For a given performance level, the total number of features required and, therefore, the runtime cost of the classifier, is observed to scale approximately logarithmically with the number of classes. The features selected by joint training are generic edge-like features, whereas the features chosen by training each class separately tend to be more object-specific. The generic features generalize better and considerably reduce the computational cost of multiclass object detection.

引用

页码：854 / 869

页数：16

共 37 条

[1] Learning to detect objects in images via a sparse, part-based representation [J].

Agarwal, S ;

Awan, A ;

Roth, D .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (11) :1475-1490

[2]

Allwein E. L., 2000, J MACHINE LEARNING R, V1, P113, DOI DOI 10.1162/15324430152733133

[3]

AMIT Y, 2003, COMPUTATIONAL STRATE

[4]

[Anonymous], P DAGM 25 PATT REC S

[5]

[Anonymous], P MSRI WORKSH NONL E

[6]

[Anonymous], LABELME DATABASE WEB

[7]

[Anonymous], 2000, P IEEE C COMP VIS PA

[8]

Bart E, 2005, P IEEE C COMP VIS PA

[9]

BERNSTEIN E, 2005, P IEEE C COMP VIS PA

[10] Multitask learning [J].

Caruana, R .

MACHINE LEARNING, 1997, 28 (01) :41-75

← 1 2 3 4 →