Benchmarking algorithms for food localization and semantic segmentation

被引:25
作者
Aslan, Sinem [1 ,2 ,3 ]
Ciocca, Gianluigi [4 ]
Mazzini, Davide [4 ]
Schettini, Raimondo [4 ]
机构
[1] Ca Foscari Univ Venice, ECLT, Dorsoduro 3911,Calle Crosera, I-30123 Venice, Italy
[2] Ca Foscari Univ Venice, DAIS, Via Torino 155, I-30172 Venice, VE, Italy
[3] Ege Univ, Int Comp Inst, TR-35100 Izmir, Bornova, Turkey
[4] Univ Milano Bicocca, Dept Informat Syst & Commun, Viale Sarca 336, I-20126 Milan, Italy
关键词
Benchmarking; Convolutional neural network; Food localization; Food segmentation; RECOGNITION SYSTEM; IMAGE RECOGNITION; CLASSIFICATION; RETRIEVAL; FEATURES;
D O I
10.1007/s13042-020-01153-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of food segmentation is quite challenging since food is characterized by intrinsic high intra-class variability. Also, segmentation of food images taken in-the-wild may be characterized by acquisition artifacts, and that could be problematic for the segmentation algorithms. A proper evaluating of segmentation algorithms is of paramount importance for the design and improvement of food analysis systems that can work in less-than-ideal real scenarios. In this paper, we evaluate the performance of different deep learning-based segmentation algorithms in the context of food. Due to the lack of large-scale food segmentation datasets, we initially create a new dataset composed of 5000 images of 50 diverse food categories. The images are accurately annotated with pixel-wise annotations. In order to test the algorithms under different conditions, the dataset is augmented with the same images but rendered under different acquisition distortions that comprise illuminant change, JPEG compression, Gaussian noise, and Gaussian blur. The final dataset is composed of 120,000 images. Using standard benchmark measures, we conducted extensive experiments to evaluate ten state-of-the-art segmentation algorithms on two tasks: food localization and semantic food segmentation.
引用
收藏
页码:2827 / 2847
页数:21
相关论文
共 78 条
[1]   Regularized uncertainty-based multi-task learning model for food analysis [J].
Aguilar, Eduardo ;
Bolanos, Marc ;
Radeva, Petia .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 :360-370
[2]   Grab, Pay, and Eat: Semantic Food Detection for Smart Restaurants [J].
Aguilar, Eduardo ;
Remeseiro, Bealriz ;
Bolanos, Marc ;
Radeva, Petia .
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (12) :3266-3275
[3]  
[Anonymous], 2014, Comput. Sci.
[4]  
[Anonymous], 2014, INT C LEARN REPR ICL
[5]  
[Anonymous], 2018, ARXIV180804702
[6]  
[Anonymous], 2018, ARXIV180306815
[7]  
[Anonymous], 2016, ENET DEEP NEURAL NET
[8]  
[Anonymous], 2015, 2015 IEEE INT C MULT
[9]  
[Anonymous], 2019, ABS190307437 ARXIV C
[10]  
[Anonymous], 2013, Bioinfor-matics and Bioengineering (BIBE), 13th International Conference on