Microsoft COCO: Common Objects in Context

被引:27168
|
作者
Lin, Tsung-Yi [1 ]
Maire, Michael [2 ]
Belongie, Serge [1 ]
Hays, James [3 ]
Perona, Pietro [2 ]
Ramanan, Deva [4 ]
Dollar, Piotr [5 ]
Zitnick, C. Lawrence [5 ]
机构
[1] Cornell, Ithaca, NY 14850 USA
[2] CALTECH, Pasadena, CA 91125 USA
[3] Brown Univ, Providence, RI 02912 USA
[4] Univ Calif Irvine, Irvine, CA 92717 USA
[5] Microsoft Res, New York, NY USA
来源
关键词
D O I
10.1007/978-3-319-10602-1_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding. This is achieved by gathering images of complex everyday scenes containing common objects in their natural context. Objects are labeled using per-instance segmentations to aid in precise object localization. Our dataset contains photos of 91 objects types that would be easily recognizable by a 4 year old. With a total of 2.5 million labeled instances in 328k images, the creation of our dataset drew upon extensive crowd worker involvement via novel user interfaces for category detection, instance spotting and instance segmentation. We present a detailed statistical analysis of the dataset in comparison to PASCAL, ImageNet, and SUN. Finally, we provide baseline performance analysis for bounding box and segmentation detection results using a Deformable Parts Model.
引用
收藏
页码:740 / 755
页数:16
相关论文
共 50 条
  • [31] Common denominator (The exhibition The New Painting of Common Objects)
    Coplans, J
    ARTFORUM, 2003, 41 (05): : 18 - 18
  • [32] Evaluating Microsoft Face API in the context of student classroom attendance
    Marjanovic, M.
    Kramberger, T.
    Kramberger, R.
    Cesar, I
    2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 182 - 185
  • [33] Visual Objects As Context: Exploiting Visual Objects for Lexical Entailment
    Muraoka, Masayasu
    Nasukawa, Tetsuya
    Bhattacharjee, Bishwaranjan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2723 - 2735
  • [34] THE CONTEXT OF THE "COMMON WORD"
    Christiansen, Drew S. J.
    REVIEW OF FAITH & INTERNATIONAL AFFAIRS, 2008, 6 (04): : 49 - 52
  • [35] Introducing tangible objects into motion controlled gameplay using Microsoft® Kinect TM
    Bozgeyikli, Gamze
    Bozgeyikli, Evren
    Isler, Veysi
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2013, 24 (3-4) : 429 - 441
  • [36] Handling mathematical objects: representations and context
    Jessica Carter
    Synthese, 2013, 190 : 3983 - 3999
  • [37] PUTTING PROPERTIES OF OBJECTS IN CONTEXT - REPLY
    CRIST, WB
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1981, 110 (03) : 303 - 305
  • [38] RECIPE CONTEXT NULL OBJECTS IN ENGLISH
    MASSAM, D
    ROBERGE, Y
    LINGUISTIC INQUIRY, 1989, 20 (01) : 134 - 139
  • [39] Context Improves Comprehension of Fronted Objects
    Line Burholt Kristensen
    Elisabeth Engberg-Pedersen
    Mads Poulsen
    Journal of Psycholinguistic Research, 2014, 43 : 125 - 140
  • [40] Objects as context for detecting their semantic parts
    Gonzalez-Garcia, Abel
    Modolo, Davide
    Ferrari, Vittorio
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6907 - 6916