Food Recognition Model Based on Deep Learning and Attention Mechanism

被引:1
作者
He, Lili [1 ,2 ]
Cai, Zhiwei [1 ,2 ]
Ouyang, Dantong [1 ,2 ]
Bai, Hongtao [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
[2] Jilin Univ, Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Peoples R China
来源
2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM | 2022年
基金
中国国家自然科学基金;
关键词
Deep learning; Attention mechanism; Multi-task learning; Food recognition; Calorie estimation;
D O I
10.1109/BigCom57025.2022.00048
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since food culture and the Internet technology has developed, it is popular to share food photos through the Internet. How to mine the useful information contained in these food images has posed a challenge to us. Image-based food recognition technology has a broad application prospect. It can not only quickly identify food category, ingredients and cooking methods, providing people with relevant recipe information, but also predict food nutrition information, which can be used in nutritional analysis, scientific dietary matching and medical health management. Considering the above problems, in this paper we conduct research and analysis from two aspects: dataset construction and recognition model design. The main contributions of this paper are as follows: (1) Since there is an absence of public datasets which contain both food cooking methods and calorie information, we construct a food dataset with rich food attributes. (2) Existing food calorie prediction methods usually need to go through multiple calculation steps while ignoring the influence of cooking methods. In addition, the mutual occlusion of ingredients, the changes in shape, color and texture of ingredients after different cooking methods, and the similarity of different types of food in terms of shape and color, all make the food image recognition tasks hard to solve.To solve these problems, a food recognition model based on multi-task attention network is proposed.
引用
收藏
页码:331 / 341
页数:11
相关论文
共 47 条
[1]  
Acouplecooks, 2014, US
[2]  
Allrecipes, 2015, US
[3]  
[Anonymous], 2012, SIGGRAPH Asia 2012 Technical Briefs, DOI 10.1145/2407746.
[4]  
Bakingmischief, 2017, US
[5]  
Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
[6]   Cross-modal Recipe Retrieval with Rich Food Attributes [J].
Chen, Jing-Jing ;
Ngo, Chong-Wah ;
Chua, Tat-Seng .
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, :1771-1779
[7]   Deep-based Ingredient Recognition for Cooking Recipe Retrieval [J].
Chen, Jingjing ;
Ngo, Chong-Wah .
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, :32-41
[8]  
Chen X, 2017, Arxiv, DOI arXiv:1705.02743
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]  
Dehais J, 2016, MADIMA'16: PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON MULTIMEDIA ASSISTED DIETARY MANAGEMENT, P91