A 4D Light-Field Dataset and CNN Architectures for Material Recognition

被引:137
作者
Wang, Ting-Chun [1 ]
Zhu, Jun-Yan [1 ]
Hiroaki, Ebi [2 ]
Chandraker, Manmohan [2 ]
Efros, Alexei A. [1 ]
Ramamoorthi, Ravi [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Univ Calif San Diego, San Diego, CA 92103 USA
来源
COMPUTER VISION - ECCV 2016, PT III | 2016年 / 9907卷
关键词
Light-field; Material recognition; Convolutional neural network; TEXTURE;
D O I
10.1007/978-3-319-46487-9_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new light-field dataset of materials, and take advantage of the recent success of deep learning to perform material recognition on the 4D light-field. Our dataset contains 12 material categories, each with 100 images taken with a Lytro Illum, from which we extract about 30,000 patches in total. To the best of our knowledge, this is the first mid-size dataset for light-field images. Our main goal is to investigate whether the additional information in a light-field (such as multiple sub-aperture views and view-dependent reflectance effects) can aid material recognition. Since recognition networks have not been trained on 4D images before, we propose and compare several novel CNN architectures to train on light-field images. In our experiments, the best performing CNN architecture achieves a 7% boost compared with 2D image classification (70% -> 77%). These results constitute important baselines that can spur further research in the use of CNNs for light-field applications. Upon publication, our dataset also enables other novel applications of light-fields, including object detection, image segmentation and view interpolation.
引用
收藏
页码:121 / 138
页数:18
相关论文
共 38 条
  • [31] Long J., 2015, P IEEE C COMP VIS PA, P3431, DOI DOI 10.1109/CVPR.2015.7298965
  • [32] Marwah K., 2013, ACM Transactions on Graphics (TOG), V32, P1
  • [33] Pairwise Rotation Invariant Co-Occurrence Local Binary Pattern
    Qi, Xianbiao
    Xiao, Rong
    Li, Chun-Guang
    Qiao, Yu
    Guo, Jun
    Tang, Xiaoou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (11) : 2199 - 2213
  • [34] Exploring the Usefulness of Light Field Cameras for Biometrics: An Empirical Study on Face and Iris Recognition
    Raghavendra, R.
    Raja, Kiran Bylappa
    Busch, Christoph
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (05) : 922 - 936
  • [35] Simonyan K., 2014, 14091556 ARXIV, DOI DOI 10.1016/J.INFSOF.2008.09.005
  • [36] Szegedy C., 2015, PROC IEEE C COMPUT V
  • [37] Wanner S., 2013, VMV, V13, P225
  • [38] Weinmann M, 2014, LECT NOTES COMPUT SC, V8691, P156, DOI 10.1007/978-3-319-10578-9_11