Comparison of convolutional neural network models for food image classification

被引：14

作者：

Yigit, Gozde Ozsert ^{[1
]}

Ozyildirim, B. Melis ^{[2
]}

机构：

[1] Gaziantep Univ, Comp Engn Dept, Gaziantep, Turkey

[2] Cukurova Univ, Comp Engn Dept, Adana, Turkey

来源：

JOURNAL OF INFORMATION AND TELECOMMUNICATION | 2018年 / 2卷 / 03期

关键词：

Deep learning; convolutional neural network; food classification; transfer learning;

D O I：

10.1080/24751839.2018.1446236

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

According to some estimates of World Health Organization, in 2014, more than 1.9 billion adults were overweight. About 13% of the world's adult population were obese. 39% of adults were overweight. The worldwide prevalence of obesity more than doubled between 1980 and 2014. Nowadays, mobile applications recording food intake of people become popular. If an improved food classification system is introduced, users take the photo of their meals and system classifies photos into the categories. Hence, we proposed a deep convolutional neural network structure trained from scratch and compared its performance with pre-trained structures Alexnet and Caffenet in INISTA 2017. This study is the extended version of it. Three different deep convolutional neural networks were trained from scratch by using different learning methods: stochastic gradient descent, Nesterov's accelerated gradient and Adaptive Moment Estimation, and compared with Alexnet and Caffenet fine-tuned with the same learning algorithms. Train, validation and test datasets were generated from Food11 and Food101 datasets. All tests were implemented through NVIDIA Digit interface on GeForce GTX1070. According to the test results, although pre-trained models provided better results than proposed structures, their performances were comparable. Moreover, learning optimization methods accelerated and improved the performances of all the compared models.

引用

页码：347 / 357

页数：11

共 23 条

[1]

Chandrakumar T., 2016, INT J ENG RES TECHNO, V5, P19, DOI [10.17577/IJERTV5IS060055, DOI 10.17577/IJERTV5IS060055]

[2]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[3]

Ginzburg B., 2014, DEEP LEARNING SUMMER

[4]

Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1

[5]

Karnowski J., 2015, ALEXNET SVM

[6] Food Image Recognition with Deep Convolutional Features [J].

Kawano, Yoshiyuki ;

Yanai, Keiji .

PROCEEDINGS OF THE 2014 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING (UBICOMP'14 ADJUNCT), 2014, :589-593

[7] FoodCam: A real-time food recognition system on a smartphone [J].

Kawano, Yoshiyuki ;

Yanai, Keiji .

MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (14) :5263-5287

[8]

Krizhevsky A., 2017, ADV NEURAL INFORM PR, V60, DOI [10.1145/3065386, DOI 10.1145/3065386]

[9] Gradient-based learning applied to document recognition [J].

Lecun, Y ;

Bottou, L ;

Bengio, Y ;

Haffner, P .

PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324

[10] DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment [J].

Liu, Chang ;

Cao, Yu ;

Luo, Yan ;

Chen, Guanling ;

Vokkarane, Vinod ;

Ma, Yunsheng .

INCLUSIVE SMART CITIES AND DIGITAL HEALTH, 2016, 9677 :37-48

← 1 2 3 →