Fuzzy bag of words for social image description

被引:16
作者
Li, Yanshan [1 ,2 ,3 ]
Liu, Weiming [2 ]
Huang, Qinghua [1 ]
Li, Xuelong [4 ]
机构
[1] S China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510640, Peoples R China
[2] S China Univ Technol, Sch Civil Engn & Transportat, Guangzhou 510640, Peoples R China
[3] Shenzhen Univ, Shenzhen 518060, Peoples R China
[4] Chinese Acad Sci, Ctr Opt IMagery Anal & Learning OPTIMAL, State Key Lab Transient Opt & Photon, Xian Inst Opt & Precis Mech, Xian 710119, Peoples R China
关键词
Bag of words; Fuzzy sets theory; Image description; Social images; FEATURES;
D O I
10.1007/s11042-014-2138-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Rapid growth of social media resources brings huge challenges and opportunities for image description technologies. The performance of image description method directly affects the accuracy of image retrieval, image annotation and image recognition. Bag of Words (BoW) as an efficient approach to describing the images has been attracting more and more attention. However, in traditional BoW, the maps between the words in the codebook and the features extracted from the images are actually ambiguous. As the Fuzzy Sets Theory (FST) is a powerful means for dealing with uncertainty efficiently, we utilize the FST to solve the problem caused by the ambiguity between the features and words. Accordingly, we propose a new type of BoW named as FBoW to describe images based on FST. Firstly, the features are extracted from the images. Secondly, k-means is utilized to learn the codebook. Thirdly, a fuzzy membership function is designed to measure the similarity between the features and words. The optimal parameters of the fuzzy membership function are obtained by using a Genetic Algorithm (GA). The histogram is generated by adding up the fuzzy membership values of each word to describe the images. The experimental results show that the proposed FBoW outperforms traditional BoW for social image description.
引用
收藏
页码:1371 / 1390
页数:20
相关论文
共 36 条
[1]  
[Anonymous], P IEEE INT C COMP VI
[2]  
[Anonymous], 2013, P 5 INT C INTERNET M, DOI DOI 10.1145/2499788.2499873
[3]  
[Anonymous], INFORM SCI IN PRESS
[4]  
Banerji S, 2013, LECT NOTES COMPUT SC, V8047, P490, DOI 10.1007/978-3-642-40261-6_59
[5]   The devil is in the details: an evaluation of recent feature encoding methods [J].
Chatfield, Ken ;
Lempitsky, Victor ;
Vedaldi, Andrea ;
Zisserman, Andrew .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[6]  
Farhangi MM, 2013, ADV INTELL SYST, V177, P681
[7]  
Fei-Fei L, 2005, PROC CVPR IEEE, P524
[8]   A Fast Approach for Integrating ORB Descriptors in the Bag of Words Model [J].
Grana, Costantino ;
Borghesani, Daniele ;
Manfredi, Marco ;
Cucchiara, Rita .
MULTIMEDIA CONTENT AND MOBILE DEVICES, 2013, 8667
[9]   Discovery of time-inconsecutive co-movement patterns of foreign currencies using an evolutionary biclustering method [J].
Huang, Qing-Hua .
APPLIED MATHEMATICS AND COMPUTATION, 2011, 218 (08) :4353-4364
[10]   A robust graph-based segmentation method for breast tumors in ultrasound images [J].
Huang, Qing-Hua ;
Lee, Su-Ying ;
Liu, Long-Zhong ;
Lu, Min-Hua ;
Jin, Lian-Wen ;
Li, An-Hua .
ULTRASONICS, 2012, 52 (02) :266-275