ToonNet: A cartoon image dataset and a DNN-based semantic classification system

被引:2
作者
Zhou, Yanqing [1 ]
Jin, Yongxu [1 ]
Luo, Anqi [1 ]
Chan, Szeyu [1 ]
Xiao, Xiangyun [1 ]
Yang, Xubo [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 16TH ACM SIGGRAPH INTERNATIONAL CONFERENCE ON VIRTUAL-REALITY CONTINUUM AND ITS APPLICATIONS IN INDUSTRY (VRCAI 2018) | 2018年
基金
中国国家自然科学基金;
关键词
Image Dataset; Cartoon Image recognition; Machine Learning;
D O I
10.1145/3284398.3284403
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cartoon-style pictures can be seen almost everywhere in our daily life. Numerous applications try to deal with cartoon pictures, a dataset of cartoon pictures will be valuable for these applications. In this paper, we first present ToonNet: a cartoon-style image recognition dataset. We construct our benchmark set by 4000 images in 12 different classes collected from the Internet with little manual filtration. We extend the basal dataset to 10000 images by adopting several methods, including snapshots of rendered 3D models with a cartoon shader, a 2D-3D-2D converting procedure using a cartoon-modeling method and a hand-drawing stylization filter. Then, we describe how to build an effective neural network for image semantic classification based on ToonNet. We present three techniques for building the Deep Neural Network (DNN), namely, IUS: Inputs Unified Stylization, stylizing the inputs to reduce the complexity of hand-drawn cartoon images; FIN: Feature Inserted Network, inserting intuitionistic and valuable global features into the network; NPN: Network Plus Network, using multiple single networks as a new mixed network. We show the efficacy and generality of our network strategies in our experiments. By utilizing these techniques, the classification accuracy can reach 78% (top-1) and 93%(top-3), which has an improvement of about 5% (top-1) compared with classical DNNs.
引用
收藏
页数:8
相关论文
共 32 条
  • [1] [Anonymous], 2016, P IEEE C COMPUTER VI
  • [2] [Anonymous], PROC CVPR IEEE
  • [3] [Anonymous], 2012, Technical Report
  • [4] [Anonymous], 2010, INT J COMPUT VISION, DOI DOI 10.1007/s11263-009-0275-4
  • [5] [Anonymous], ABS170306868 CORR
  • [6] [Anonymous], ABS160507678 CORR
  • [7] [Anonymous], P INT C MACH LEARN L
  • [8] [Anonymous], ABS17070687 CORR
  • [9] Iterative Region Merging and Object Retrieval Method Using Mean shift segmentation and Flood fill algorihtm
    Bhargava, Neeraj
    Trivedi, Prakriti
    Toshniwal, Akanksha
    Swarnkar, Himanshu
    [J]. 2013 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS (ICACC 2013), 2013, : 157 - 160
  • [10] Bolukbasi T, 2017, PR MACH LEARN RES, V70