Structure-Aware Deep Learning for Product Image Classification

被引:30
作者
Chen, Zhineng [1 ]
Al, Shanshan [2 ]
Jia, Caiyan [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
[2] Beijing Jiaotong Univ, 3 Shangyuancun Rd, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Image classification; category hierarchy; convolutional neural network; multi-class regression; multi-task learning; ASSOCIATION;
D O I
10.1145/3231742
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic product image classification is a task of crucial importance with respect to the management of online retailers. Motivated by recent advancements of deep Convolutional Neural Networks (CNN) on image classification, in this work we revisit the problem in the context of product images with the existence of a predefined categorical hierarchy and attributes, aiming to leverage the hierarchy and attributes to improve classification accuracy. With these structure-aware clues, we argue that more advanced deep models could be developed beyond the flat one-versus-all classification performed by conventional CNNs. To this end, novel efforts of this work include a salient-sensitive CNN that gazes into the product foreground by inserting a dedicated spatial attention module; a multiclass regression-based refinement that is expected to predict more accurately by merging prediction scores from multiple preceding CNNs, each corresponding to a distinct classifier in the hierarchy; and a multitask deep learning architecture that effectively explores correlations among categories and attributes for categorical label prediction. Experimental results on nearly 1 million real-world product images basically validate the effectiveness of the proposed efforts individually and jointly, from which performance gains are observed.
引用
收藏
页数:20
相关论文
共 49 条
[21]   Th1 and Th2 Immune Response in Chronic Hepatitis B Patients during a Long-Term Treatment with Adefovir Dipivoxil [J].
Jiang, Yanfang ;
Ma, Zhenhua ;
Xin, Guijie ;
Yan, Hongqing ;
Li, Wanyu ;
Xu, Huining ;
Hao, Chunhai ;
Niu, Junqi ;
Zhao, Pingwei .
MEDIATORS OF INFLAMMATION, 2010, 2010
[22]  
Khosla A., 2011, P CVPR WORKSHOP FINE, V2
[23]   Hierarchical learning of large-margin metrics for large-scale image classification [J].
Lei, Hao ;
Mei, Kuizhi ;
Xin, Jingmin ;
Dong, Peixiang ;
Fan, Jianping .
NEUROCOMPUTING, 2016, 208 :46-58
[24]   Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment, Refinement, and Retrieval [J].
Li, Xirong ;
Uricchio, Tiberio ;
Ballan, Lamberto ;
Bertini, Marco ;
Snoek, Cees G. M. ;
Del Bimbo, Alberto .
ACM COMPUTING SURVEYS, 2016, 49 (01)
[25]   Visual Tracking With Weighted Adaptive Local Sparse Appearance Model via Spatio-Temporal Context Learning [J].
Li, Zhetao ;
Zhang, Jie ;
Zhang, Kaihua ;
Li, Zhiyong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (09) :4478-4489
[26]  
Lin D, 2015, PROC CVPR IEEE, P1666, DOI 10.1109/CVPR.2015.7298775
[27]   Bilinear CNN Models for Fine-grained Visual Recognition [J].
Lin, Tsung-Yu ;
RoyChowdhury, Aruni ;
Maji, Subhransu .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1449-1457
[28]   DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations [J].
Liu, Ziwei ;
Luo, Ping ;
Qiu, Shi ;
Wang, Xiaogang ;
Tang, Xiaoou .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1096-1104
[29]   Exploratory Product Image Search With Circle-to-Search Interaction [J].
Lu, Shiyang ;
Mei, Tao ;
Wang, Jingdong ;
Zhang, Jian ;
Wang, Zhiyong ;
Li, Shipeng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (07) :1190-1202
[30]   Zero-Shot Learning via Attribute Regression and Class Prototype Rectification [J].
Luo, Changzhi ;
Li, Zhetao ;
Huang, Kaizhu ;
Feng, Jiashi ;
Wang, Meng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (02) :637-648