Toward Retail Product Recognition on Grocery Shelves

被引:19
作者
Varol, Gul [1 ,3 ]
Kuzu, Ridvan Salih [2 ,3 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
[2] Bogazici Univ, Dept Elect & Elect Engn, TR-34342 Istanbul, Turkey
[3] Idea Teknol, TR-34398 Istanbul, Turkey
来源
SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014) | 2015年 / 9443卷
关键词
Object recognition; object detection; retail product; grocery image; bag of words;
D O I
10.1117/12.2179127
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
This paper addresses the problem of retail product recognition on grocery shelf images. We present a technique for accomplishing this task with a low time complexity. We decompose the problem into detection and recognition. The former is achieved by a generic product detection module which is trained on a specific class of products (e.g. tobacco packages). Cascade object detection framework of Viola and Jones [1] is used for this purpose. We further make use of Support Vector Machines (SVMs) to recognize the brand inside each detected region. We extract both shape and color information; and apply feature-level fusion from two separate descriptors computed with the bag of words approach. Furthermore, we introduce a dataset (available on request) that we have collected for similar research purposes. Results are presented on this dataset of more than 5,000 images consisting of 10 tobacco brands. We show that satisfactory detection and classification can be achieved on devices with cheap computational power. Potential applications of the proposed approach include planogram compliance control, inventory management and assisting visually impaired people during shopping.
引用
收藏
页数:7
相关论文
共 12 条
[1]  
[Anonymous], ICCV
[2]  
[Anonymous], 2005, ICCV
[3]  
Auclair A, 2008, LECT NOTES COMPUT SC, V4918, P224, DOI 10.1007/978-3-540-79860-6_18
[4]  
Dorko G., 2003, ICCV
[5]  
Kleban J., 2008, IEEE INT C MULT EXP
[6]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[7]  
Nilsback M.E., 2006, CVPR
[8]  
Opalach A., 2012, U.S. Patent, Patent No. [8,189,855, 8189855]
[9]  
Quelhas P., 2005, ICCV
[10]  
Varol G., 2014, IEEE SIGN PROC COMM