An approach of bag-of-words based on visual attention model for pornographic images recognition in compressed domain

被引:33
作者
Zhang, Jing [1 ]
Sui, Lei [1 ]
Zhuo, Li [1 ]
Li, Zhenwei [1 ]
Yang, Yuncong [1 ]
机构
[1] Beijing Univ Technol, Signal & Informat Proc Lab, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
Pornographic images recognition; Bag-of-words; Visual attention model; Compressed domain; Pornographic region; SEARCH;
D O I
10.1016/j.neucom.2012.11.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bag-of-words (BoW) model has been widely used in pornographic images recognition and filtering. Most of existing methods create BoW from images with a scale-invariant feature transform (SIFT) descriptor in the pixel domain. These methods require extra processing time to decompress images in compressed formats. In addition, the SIFT descriptor only views local feature points in centers of some regions as BoW, which ignores a major role of image region in the human visual system. Different from the above methods in this paper, a BoW approach based on the visual attention model is proposed to recognize pornographic images in compressed domain, which includes the following steps: (1) face is detected to remove the face or ID photo from some benign images; (2) a visual attention model is built according to the characteristics of pornographic image; (3) pornographic regions are detected by visual attention model in compressed domain; (4) four features of color, texture, intensity and skin are extracted in pornographic regions; (5) BoW is created by k-means cluster and (6) BoW will be used to represent and recognize pornographic images. Experimental results show that proposed BoW approach based on the visual attention model can more accurately recognize pornographic images with less computational time. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:145 / 152
页数:8
相关论文
共 34 条
[1]  
Chang C. C., LIBSVM PRACTICAL GUI
[2]   A compressed domain scheme for classifying block edge patterns [J].
Chang, HS ;
Kang, K .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (02) :145-151
[3]  
Deselaers T, 2008, INT C PATT RECOG, P2100
[4]  
Fleck M. M., 1996, Computer Vision - ECCV '96. 4th Eurpean Conference on Computer Proceedings, P593
[5]   Identifying nude pictures [J].
Forsyth, DA ;
Fleck, MM .
THIRD IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION - WACV '96, PROCEEDINGS, 1996, :103-108
[6]  
Gang Zhao, 2008, 2008 Second International Conference on Future Generation Communication and Networking Symposia (FGCNS), P107, DOI 10.1109/FGCNS.2008.41
[7]   Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search [J].
Gao, Yue ;
Wang, Meng ;
Zha, Zheng-Jun ;
Shen, Jialie ;
Li, Xuelong ;
Wu, Xindong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (01) :363-376
[8]   3-D Object Retrieval and Recognition With Hypergraph Analysis [J].
Gao, Yue ;
Wang, Meng ;
Tao, Dacheng ;
Ji, Rongrong ;
Dai, Qionghai .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (09) :4290-4303
[9]   Less is More: Efficient 3-D Object Retrieval With Query View Selection [J].
Gao, Yue ;
Wang, Meng ;
Zha, Zheng-Jun ;
Tian, Qi ;
Dai, Qionghai ;
Zhang, Naiyao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (05) :1007-1018
[10]   A saliency-based search mechanism for overt and covert shifts of visual attention [J].
Itti, L ;
Koch, C .
VISION RESEARCH, 2000, 40 (10-12) :1489-1506