Deep-based Ingredient Recognition for Cooking Recipe Retrieval

被引:265
作者
Chen, Jingjing [1 ]
Ngo, Chong-Wah [1 ]
机构
[1] City Univ Hong Kong, Kowloon, Hong Kong, Peoples R China
来源
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE | 2016年
关键词
Food categorization; ingredient recognition; zero-shot retrieval; multi-task deep learning; FOOD RECOGNITION;
D O I
10.1145/2964284.2964315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Retrieving recipes corresponding to given dish pictures facilitates the estimation of nutrition facts, which is crucial to various health relevant applications. The current approaches mostly focus on recognition of food category based on global dish appearance without explicit analysis of ingredient composition. Such approaches are incapable for retrieval of recipes with unknown food categories, a problem referred to as zero-shot retrieval. On the other hand, content-based retrieval without knowledge of food categories is also difficult to attain satisfactory performance due to large visual variations in food appearance and ingredient composition. As the number of ingredients is far less than food categories, understanding ingredients underlying dishes in principle is more scalable than recognizing every food category and thus is suitable for zero-shot retrieval. Nevertheless, ingredient recognition is a task far harder than food categorization, and this seriously challenges the feasibility of relying on them for retrieval. This paper proposes deep architectures for simultaneous learning of ingredient recognition and food categorization, by exploiting the mutual but also fuzzy relationship between them. The learnt deep features and semantic labels of ingredients are then innovatively applied for zero-shot retrieval of recipes. By experimenting on a large Chinese food dataset with images of highly complex dish appearance, this paper demonstrates the feasibility of ingredient recognition and sheds light on this zero-shot problem peculiar to cooking recipe retrieval.
引用
收藏
页码:32 / 41
页数:10
相关论文
共 37 条
[1]   FoodLog: Multimedia Tool for Healthcare Applications [J].
Aizawa, Kiyoharu ;
Ogawa, Makoto .
IEEE MULTIMEDIA, 2015, 22 (02) :4-8
[2]  
[Anonymous], 2014, Computer Science
[3]  
[Anonymous], THE AMERICAN JOURNAL
[4]  
[Anonymous], 2015, MATH PROB ENG, DOI DOI 10.1016/J.CMET.2015.09.010
[5]  
[Anonymous], RECOGNITION VOLUME E
[6]  
[Anonymous], COMPUTER VISION PATT
[7]  
[Anonymous], 2013, NeurIPS
[8]  
[Anonymous], COMP VIS PATT REC WO
[9]  
[Anonymous], J BIOMEDICAL HLTH IN
[10]  
[Anonymous], 2012, PROC SIGGRAPH ASIA T