Image-Based Food Calorie Estimation Using Recipe Information

被引:25
作者
Ege, Takumi [1 ]
Yanai, Keiji [1 ]
机构
[1] Univ Electrocommun, Dept Informat, Chofu, Tokyo 1828585, Japan
关键词
food image recognition; image-based food calorie estimation; convolutional neural network; multi-task CNN;
D O I
10.1587/transinf.2017MVP0027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, mobile applications for recording everyday meals draw much attention for self dietary. However, most of the applications return food calorie values simply associated with the estimated food categories, or need for users to indicate the rough amount of foods manually. In fact, it has not been achieved to estimate food calorie from a food photo with practical accuracy, and it remains an unsolved problem. Then, in this paper, we propose estimating food calorie from a food photo by simultaneous learning of food calories, categories, ingredients and cooking directions using deep learning. Since there exists a strong correlation between food calories and food categories, ingredients and cooking directions information in general, we expect that simultaneous training of them brings performance boosting compared to independent single training. To this end, we use a multi-task CNN. In addition, in this research, we construct two kinds of datasets that is a dataset of calorie-annotated recipe collected from Japanese recipe sites on the Web and a dataset collected from an American recipe site. In the experiments, we trained both multi-task and single-task CNNs, and compared them. As a result, a multi-task CNN achieved the better performance on both food category estimation and food calorie estimation than single-task CNNs. For the Japanese recipe dataset, by introducing a multi-task CNN, 0.039 were improved on the correlation coefficient, while for the American recipe dataset, 0.090 were raised compared to the result by the single-task CNN. In addition, we showed that the proposed multi-task CNN based method outperformed search-based methods proposed before.
引用
收藏
页码:1333 / 1341
页数:9
相关论文
共 22 条
[1]   Multi-Task CNN Model for Attribute Prediction [J].
Abdulnabi, Abrar H. ;
Wang, Gang ;
Lu, Jiwen ;
Jia, Kui .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) :1949-1959
[2]  
[Anonymous], P INT C LEARN REPR
[3]  
[Anonymous], 2015, P INT C MACH LEARN
[4]  
[Anonymous], 2015, PROC LEARNINGSYS
[5]  
[Anonymous], P WORKSH ACM MULT TH
[6]  
[Anonymous], 2013, AD VANCES NEURAL INF
[7]   Leveraging Context to Support Automated Food Recognition in Restaurants [J].
Bettadapura, Vinay ;
Thomaz, Edison ;
Parnami, Aman ;
Abowd, Gregory D. ;
Essa, Irfan .
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, :580-587
[8]  
Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
[9]   Deep-based Ingredient Recognition for Cooking Recipe Retrieval [J].
Chen, Jingjing ;
Ngo, Chong-Wah .
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, :32-41
[10]  
Chen Mei-Yun, 2012, SIGGRAPH Asia 2012 Technical Briefs., P29