SINGLE-VIEW FOOD PORTION ESTIMATION: LEARNING IMAGE-TO-ENERGY MAPPINGS USING GENERATIVE ADVERSARIAL NETWORKS

被引:0
作者
Fang, Shaobo [1 ]
Shao, Zeman [1 ]
Mao, Runyu [1 ]
Fu, Chichen [1 ]
Kerr, Deborah A. [2 ]
Boushey, Carol J. [3 ]
Delp, Edward J. [1 ]
Zhu, Fengqing [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Curtin Univ, Sch Publ Hlth, Perth, WA, Australia
[3] Univ Hawaii, Canc Ctr, Canc Epidemiol Program, Honolulu, HI 96822 USA
来源
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2018年
基金
美国国家科学基金会;
关键词
Dietary Assessment; Food Portion Estimation; Generative Models; Generative Adversarial Networks; Image-to-Energy Mapping; DIETARY ASSESSMENT; SYSTEM; MODEL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Due to the growing concern of chronic diseases and other health problems related to diet, there is a need to develop accurate methods to estimate an individual's food and energy intake. Measuring accurate dietary intake is an open research problem. In particular, accurate food portion estimation is challenging since the process of food preparation and consumption impose large variations on food shapes and appearances. In this paper, we present a food portion estimation method to estimate food energy (kilocalories) from food images using Generative Adversarial Networks (GAN). We introduce the concept of an "energy distribution" for each food image. To train the GAN, we design a food image dataset based on ground truth food labels and segmentation masks for each food image as well as energy information associated with the food image. Our goal is to learn the mapping of the food image to the food energy. We can then estimate food energy based on the energy distribution. We show that an average energy estimation error rate of 10.89% can be obtained by learning the image-to-energy mapping.
引用
收藏
页码:251 / 255
页数:5
相关论文
共 44 条
[1]   Multi-Task CNN Model for Attribute Prediction [J].
Abdulnabi, Abrar H. ;
Wang, Gang ;
Lu, Jiwen ;
Jia, Kui .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) :1949-1959
[2]   Food Balance Estimation by Using Personal Dietary Tendencies in a Multimedia Food Log [J].
Aizawa, Kiyoharu ;
Maruyama, Yuto ;
Li, He ;
Morikawa, Chamin .
IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (08) :2176-2185
[3]  
[Anonymous], P SIGGRAPH AS 2012 T
[4]  
[Anonymous], 2009, P ACM MULT WORKSH MU
[5]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[6]  
Bossard L, 2014, LECT NOTES COMPUT SC, V8694, P446, DOI 10.1007/978-3-319-10599-4_29
[7]  
Chen HC, 2012, NORTHEAST BIOENGIN C, P95, DOI 10.1109/NEBC.2012.6206979
[8]   PFID: PITTSBURGH FAST-FOOD IMAGE DATASET [J].
Chen, Mei ;
Dhingra, Kapil ;
Wu, Wen ;
Yang, Lei ;
Sukthankar, Rahul ;
Yang, Jie .
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, :289-+
[9]   Novel Technologies for Assessing Dietary Intake: Evaluating the Usability of a Mobile Telephone Food Record Among Adults and Adolescents [J].
Daugherty, Bethany L. ;
Schap, TusaRebecca E. ;
Ettienne-Gittens, Reynolette ;
Zhu, Fengqing M. ;
Bosch, Marc ;
Delp, Edward J. ;
Ebert, David S. ;
Kerr, Deborah A. ;
Boushey, Carol J. .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2012, 14 (02) :156-167
[10]  
Dehais J., 2013, 13 INT C BIOINF BIOE, P1, DOI [DOI 10.1109/BIBE.2013.6701615, 10.1109/BIBE.2013.6701615]