Weight Estimation from an RGB-D camera in top-view configuration

被引：2

作者：

Mameli, Marco ^{[1
]}

Paolanti, Marina ^{[1
]}

Conci, Nicola ^{[2
]}

Tessaro, Filippo ^{[2
]}

Frontoni, Emanuele ^{[1
]}

Zingaretti, Primo ^{[1
]}

机构：

[1] Univ Politecn Marche, Dipartimento Ingn Informaz DII, Via Brecce Bianche 12, I-60131 Ancona, Italy

[2] Univ Trento, Dipartimento Ingn & Sci Informaz, Via Calepina 14, I-38122 Trento, Italy

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

关键词：

Weight Estimation; Deep Neural Networks; RGB-D camera; Top-View Configuration; REAL-TIME; TRACKING; SENSOR; MULTIPLE; HUMANS; PEOPLE; ROBUST;

D O I：

10.1109/ICPR48806.2021.9412519

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The development of so-called soft-biometrics aims at providing information related to the physical and behavioural characteristics of a person. This paper focuses on body weight estimation based on the observation from a top-view RGB-D camera. In fact, the capability to estimate the weight of a person can be of help in many different applications, from health-related scenarios, to business intelligence and retail analytics. To deal with this issue, a TVWE (Top-View Weight Estimation) framework is proposed with the aim of predicting the weight. The approach relies on the adoption of Deep Neural Networks (DNNs) that have been trained on depth data. Each network has also been modified in their top section to replace classification with prediction inference. The performance of five state-of-art DNNs have been compared, namely VGG16, ResNet, Inception, DenseNet and Efficient-Net. In addition, a convolutional autoencoder has also been included for completeness. Considering the limited literature in this domain, the TVWE framework has been evaluated on a new publicly available dataset: "VRAI Weight estimation Dataset", which also collects, for each subject, labels related to weight, gender, and height. The experimental results have demonstrated that the proposed methods are suitable for this task, bringing different and significant insights for the application of the solution in different domains.

引用

页码：7715 / 7722

页数：8

共 39 条

[1]

[Anonymous], 2017, 2017 COMPUTING CARDI

[2]

[Anonymous], 2016, VIDEO ANAL FACE FACI

[3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[4]

Benalcazar D., 2017, 2017 IEEE 2 EC TECHN, P1

[5]

Bois D. D., 1916, EXP BIOL MED, V13, P77

[6] Movements Analysis of Preterm Infants by Using Depth Sensor [J].

Cenci, Annalisa ;

Liciotti, Daniele ;

Frontoni, Emanuele ;

Zingaretti, Primo ;

Carnielli, Virgilio Paolo .

PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND MACHINE LEARNING (IML'17), 2017,

[7] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[8] Robust People Counting System Based on Sensor Fusion [J].

Dan, Byoung-Kyu ;

Kim, You-Sun ;

Suryanto ;

Jung, June-Young ;

Ko, Sung-Jea .

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (03) :1013-1021

[9] Robust Multiperson Tracking from a Mobile Platform [J].

Ess, Andreas ;

Leibe, Bastian ;

Schindler, Konrad ;

van Gool, Luc .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (10) :1831-1846

[10]

Felzenszwalb Pedro F., 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, V1, pI

← 1 2 3 4 →