Where and What Am I Eating? Image-Based Food Menu Recognition

被引:1
作者
Bolanos, Marc [1 ,2 ]
Valdivia, Marc [1 ]
Radeva, Petia [1 ,2 ]
机构
[1] Univ Barcelona, Barcelona, Spain
[2] Comp Vis Ctr, Bellaterra, Spain
来源
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT VI | 2019年 / 11134卷
关键词
Multimodal learning; Computer vision; Food recognition;
D O I
10.1007/978-3-030-11024-6_45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Food has become a very important aspect of our social activities. Since social networks and websites like Yelp appeared, their users have started uploading photos of their meals to the Internet. This phenomenon opens a whole world of possibilities for developing models for applying food analysis and recognition on huge amounts of real-world data. A clear application could consist in applying image food recognition by using the menu of the restaurants. Our model, based on Convolutional Neural Networks and Recurrent Neural Networks, is able to learn a language model that generalizes on never seen dish names without the need of re-training it. According to the Ranking Loss metric, the results obtained by the model improve the baseline by a 15%.
引用
收藏
页码:590 / 605
页数:16
相关论文
共 33 条
[1]  
Aguilar E., 2017, ARXIV170904800
[2]   Food Recognition Using Fusion of Classifiers Based on CNNs [J].
Aguilar, Eduardo ;
Bolanos, Marc ;
Radeva, Petia .
IMAGE ANALYSIS AND PROCESSING (ICIAP 2017), PT II, 2017, 10485 :213-224
[3]  
[Anonymous], 2017, ARXIV171105128
[4]  
[Anonymous], 2004, Food and health in Europe: a new basis for action
[5]  
[Anonymous], 2017, P 11 INT WORKSHOP SE
[6]   Leveraging Context to Support Automated Food Recognition in Restaurants [J].
Bettadapura, Vinay ;
Thomaz, Edison ;
Parnami, Aman ;
Abowd, Gregory D. ;
Essa, Irfan .
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, :580-587
[7]   Food Ingredients Recognition Through Multi-label Learning [J].
Bolanos, Marc ;
Ferra, Aina ;
Radeva, Petia .
NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2017, 2017, 10590 :394-402
[8]   VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering [J].
Bolanos, Marc ;
Peris, Alvaro ;
Casacuberta, Francisco ;
Radeva, Petia .
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 :372-380
[9]  
Bolaños M, 2016, INT C PATT RECOG, P3140, DOI 10.1109/ICPR.2016.7900117
[10]  
Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29