IMAGE SEGMENTATION AND RECOGNITION FOR MULTI-CLASS CHINESE FOOD

被引:6
作者
Liang, Yuxiang [1 ]
Li, Jiangfeng [1 ]
Zhao, Qinpei [1 ]
Rao, Weixiong [1 ]
Zhang, Chenxi [1 ]
Wang, Congrong [2 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai, Peoples R China
[2] Tongji Univ, Dept Endocrinol & Metab, Shanghai Peoples Hosp 4, Sch Med, Shanghai, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2022年
基金
上海市自然科学基金;
关键词
Food Image Segmentation; Food Recognition; Multi-Class Chinese Food; Deep Learning; CLASSIFICATION;
D O I
10.1109/ICIP46576.2022.9898001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For multi-class food images, an excellent segmentation method has a great influence on accuracy of recognition result, and hence improve the effectiveness of dietary management for diabetics. For Chinese food images, there are some challenges during the processing, such as blurred outlines, rich colors, and varied appearances due to various cooking methods. To overcome these difficulties, we propose a ChineseFoodSeg approach to obtain accurate and efficient multi-class segmentation. A food recognition model, Two-Path Global Local Network (TPGLNet), is also introduced to jointly learn complementary global and local features of the bounding box and the segment. Experiments on the ChineseDiabetesFood187 dataset collected by us demonstrate that the new methods are competitive compared to existing segmentation and recognition methods.
引用
收藏
页码:3938 / 3942
页数:5
相关论文
共 26 条
[11]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[12]   Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition [J].
Jiang, Shuqiang ;
Min, Weiqing ;
Liu, Linhu ;
Luo, Zhengdong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :265-276
[13]   Image-Based Food Classification and Volume Estimation for Dietary Assessment: A Review [J].
Lo, Frank Po Wen ;
Sun, Yingnan ;
Qiu, Jianing ;
Lo, Benny .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (07) :1926-1939
[14]   ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design [J].
Ma, Ningning ;
Zhang, Xiangyu ;
Zheng, Hai-Tao ;
Sun, Jian .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :122-138
[15]   ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network [J].
Min, Weiqing ;
Liu, Linhu ;
Wang, Zhiling ;
Luo, Zhengdong ;
Wei, Xiaoming ;
Wei, Xiaolin ;
Jiang, Shuqiang .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :393-401
[16]   Ingredient-Guided Cascaded Multi-Attention Network for Food Recognition [J].
Min, Weiqing ;
Liu, Linhu ;
Luo, Zhengdong ;
Jiang, Shuqiang .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :1331-1339
[17]   Im2Calories: towards an automated mobile vision food diary [J].
Myers, Austin ;
Johnston, Nick ;
Rathod, Vivek ;
Korattikara, Anoop ;
Gorban, Alex ;
Silberman, Nathan ;
Guadarrama, Sergio ;
Papandreou, George ;
Huang, Jonathan ;
Murphy, Kevin .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1233-1241
[18]   Multiresolution gray-scale and rotation invariant texture classification with local binary patterns [J].
Ojala, T ;
Pietikäinen, M ;
Mäenpää, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (07) :971-987
[19]   Carbohydrate Estimation by a Mobile Phone-Based System Versus Self-Estimations of Individuals With Type 1 Diabetes Mellitus: A Comparative Study [J].
Rhyner, Daniel ;
Loher, Hannah ;
Dehais, Joachim ;
Anthimopoulos, Marios ;
Shevchik, Sergey ;
Botwey, Ransford Henry ;
Duke, David ;
Stettler, Christoph ;
Diem, Peter ;
Mougiakakou, Stavroula .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2016, 18 (05)
[20]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241