Human's Scene Sketch Understanding

被引:9
作者
Ye, Yuxiang [1 ]
Lu, Yijuan [1 ]
Jiang, Hao [2 ]
机构
[1] Texas State Univ, San Marcos, TX 78666 USA
[2] Boston Coll, Chestnut Hill, MA 02167 USA
来源
ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL | 2016年
关键词
Sketch understanding; scene sketch; deep learning;
D O I
10.1145/2911996.2912067
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Human's sketch understanding is important. It has many applications in human computer interaction, multimedia, and computer vision. Recognizing human sketches is also challenging. Previous methods focus on single-object sketch recognition. Understanding human's scene sketch that involves multiple objects and their complex interactions has not been explored. In this paper, we tackle this new problem. We create the first scene sketch dataset "Scene250" and propose a deep learning method to understand human scene sketches. We propose "Scene-Net", a new deep convolutional neural network (CNN) structure, based on which we build a novel scene sketch recognition system. Our system has been tested on the collected scene sketch dataset and compared with other state-of-the-art CNNs and sketch recognition approaches. Our experimental results demonstrate that our method achieves the state of art.
引用
收藏
页码:355 / 358
页数:4
相关论文
共 9 条
  • [1] [Anonymous], INT C INT US INT
  • [2] [Anonymous], AFIPS
  • [3] [Anonymous], 2015, ARXIV150107873
  • [4] [Anonymous], 2012, SIGGRAPH
  • [5] [Anonymous], SIGGRAPH
  • [6] Geoffrey EHinton., 2012, Improving neural networks by preventing co-adaptation of feature detectors
  • [7] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [8] LeCun Y., 2002, Efficient backprop. Neural Networks: Tricks of the Trade, P9
  • [9] Geometric matching for clip-art drawing retrieval
    Sousa, Pedro
    Fonseca, Manuel J.
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2009, 20 (02) : 71 - 83