Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data

被引:4
作者
Raju, Viprav B. B. [1 ]
Imtiaz, Masudul H. H. [2 ]
Sazonov, Edward [1 ]
机构
[1] Univ Alabama, Dept Elect & Comp Engn, Tuscaloosa, AL 35487 USA
[2] Clarkson Univ, Dept Elect & Comp Engn, Potsdam, NY 13699 USA
基金
比尔及梅琳达.盖茨基金会; 美国国家卫生研究院;
关键词
dietary assessment; thermal imaging; image segmentation; food image segmentation; sensors; multi-modal sensors; REGISTRATION;
D O I
10.3390/s23020560
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Sensor-based food intake monitoring has become one of the fastest-growing fields in dietary assessment. Researchers are exploring imaging-sensor-based food detection, food recognition, and food portion size estimation. A major problem that is still being tackled in this field is the segmentation of regions of food when multiple food items are present, mainly when similar-looking foods (similar in color and/or texture) are present. Food image segmentation is a relatively under-explored area compared with other fields. This paper proposes a novel approach to food imaging consisting of two imaging sensors: color (Red-Green-Blue) and thermal. Furthermore, we propose a multi-modal four-Dimensional (RGB-T) image segmentation using a k-means clustering algorithm to segment regions of similar-looking food items in multiple combinations of hot, cold, and warm (at room temperature) foods. Six food combinations of two food items each were used to capture RGB and thermal image data. RGB and thermal data were superimposed to form a combined RGB-T image and three sets of data (RGB, thermal, and RGB-T) were tested. A bootstrapped optimization of within-cluster sum of squares (WSS) was employed to determine the optimal number of clusters for each case. The combined RGB-T data achieved better results compared with RGB and thermal data, used individually. The mean +/- standard deviation (std. dev.) of the F1 score for RGB-T data was 0.87 +/- 0.1 compared with 0.66 +/- 0.13 and 0.64 +/- 0.39, for RGB and Thermal data, respectively.
引用
收藏
页数:15
相关论文
共 53 条
[41]   Integration K-Means Clustering Method and Elbow Method For Identification of The Best Customer Profile Cluster [J].
Syakur, M. A. ;
Khotimah, B. K. ;
Rochman, E. M. S. ;
Satoto, B. D. .
2ND INTERNATIONAL CONFERENCE ON VOCATIONAL EDUCATION AND ELECTRICAL ENGINEERING (ICVEE), 2018, 336
[42]   A pyramid approach to subpixel registration based on intensity [J].
Thevenaz, P ;
Ruttimann, UE ;
Unser, M .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (01) :27-41
[43]  
VANDENELSEN PA, 1994, P SOC PHOTO-OPT INS, V2359, P227
[44]   A survey of medical image registration - under review [J].
Viergever, Max A. ;
Maintz, J. B. Antoine ;
Klein, Stefan ;
Murphy, Keelin ;
Staring, Marius ;
Pluim, Josien P. W. .
MEDICAL IMAGE ANALYSIS, 2016, 33 :140-144
[45]   Image registration by control points pairing using the invariant properties of line segments [J].
Wang, WH ;
Chen, YC .
PATTERN RECOGNITION LETTERS, 1997, 18 (03) :269-281
[46]   Distinguishing Nigerian Food Items and Calorie Content with Hyperspectral Imaging [J].
Wang, Xinzuo ;
Rohani, Neda ;
Manerikar, Adwaiy ;
Katsagellos, Aggelos ;
Cossairt, Oliver ;
Alshurafa, Nabil .
NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2017, 2017, 10590 :462-470
[47]  
Wang Y, 2017, IEEE IMAGE PROC, P1277, DOI 10.1109/ICIP.2017.8296487
[48]  
Wang Y, 2016, IEEE IMAGE PROC, P2544, DOI 10.1109/ICIP.2016.7532818
[49]   Point Cloud Hand-Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover [J].
Zhang, Yan ;
Mueller, Steffen ;
Stephan, Benedict ;
Gross, Horst-Michael ;
Notni, Gunther .
SENSORS, 2021, 21 (16)
[50]   Deviation-Sparse Fuzzy C-Means With Neighbor Information Constraint [J].
Zhang, Yuxuan ;
Bai, Xiangzhi ;
Fan, Ruirui ;
Wang, Zihan .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (01) :185-199