Fine-grained pornographic image recognition with multiple feature fusion transfer learning

被引:18
作者
Lin, Xinnan [1 ]
Qin, Feiwei [1 ]
Peng, Yong [1 ]
Shao, Yanli [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Pornographic image recognition; Image classification; Multiple feature fusion; Transfer learning; INTERNET PORNOGRAPHY; NEURAL-NETWORKS; IMPACT;
D O I
10.1007/s13042-020-01157-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image has become a main medium of Internet information dissemination, makes it easy for an Internet visitor to get pornographic images with just few clicks on websites. It is necessary to build pornographic image recognition systems since uncontrolled spreading of adult content could be harm to the adolescents. Previous solutions for pornographic image recognition are usually based on hand-crafted features like human skin color. Hand-crafted feature based methods are straightforward to understand and use but limited in specific situations. In this paper, we propose a deep learning based approach with multiple feature fusion transfer learning strategy. Firstly, we obtain the training data from an open data set called NSFW with 120,000+ images. Images would be classified into different levels according to its content sensitivity. Then we employ data augment methods, train a deep convolutional neural network to extract image features and conduct the classification job, without the need for hand-crafted rules. A pre-trained model is used to initialize the network and help extract the basic features. Furthermore, we propose a fusion method that makes use of multiple transfer learning models in inference, to improve the accuracy on the test set. The experimental results prove that our method achieves high accuracy on the pornographic image recognition and inspection task.
引用
收藏
页码:73 / 86
页数:14
相关论文
共 40 条
[1]  
AGASTYA IMA, 2018, 2018 4 INT C ADV COM, P1, DOI DOI 10.1109/ICACCAF.2018.8776843
[2]   Pooling in image representation: The visual codeword point of view [J].
Avila, Sandra ;
Thome, Nicolas ;
Cord, Matthieu ;
Valle, Eduardo ;
Araujo, Arnaldo de A. .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (05) :453-465
[3]  
Blanchard N, 2018, FIRST GRAND CHALLENGE AND WORKSHOP ON HUMAN MULTIMODAL LANGUAGE (CHALLENGE-HML), P1
[4]  
Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
[5]  
Caetano C, 2014, EUR SIGNAL PR CONF, P1681
[6]   A Survey on Deep Transfer Learning [J].
Tan, Chuanqi ;
Sun, Fuchun ;
Kong, Tao ;
Zhang, Wenchang ;
Yang, Chao ;
Liu, Chunfang .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 :270-279
[7]  
Deselaers T, 2008, PROC CVPR IEEE, P3017
[8]  
Glorot X., 2011, P 14 INT C ART INT S
[9]  
He K., 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), DOI [DOI 10.1109/CVPR.2016.90, 10.1109/CVPR.2016.90]
[10]   Bag of Tricks for Image Classification with Convolutional Neural Networks [J].
He, Tong ;
Zhang, Zhi ;
Zhang, Hang ;
Zhang, Zhongyue ;
Xie, Junyuan ;
Li, Mu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :558-567