A Real-Life Evaluation of Supervised and Semi-Supervised Machine Learning Approaches for Indirect Estimation of Indoor Occupancy

被引:0
作者
Mena-Martinez, Alma [1 ]
Davila Delgado, Manuel [2 ]
Alvarado-Uribe, Joanna [1 ,3 ]
Ceballos, Hector G. [1 ]
机构
[1] Tecnol Monterrey, Sch Engn & Sci, Monterrey 64700, Nuevo Leon, Mexico
[2] Birmingham City Univ BCU, Appl Artificial Intelligence AI & Business Transfo, Birmingham B4 7BD, Warwick, England
[3] Tecnol Monterrey, Inst Future Educ, Monterrey 64700, Nuevo Leon, Mexico
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Support vector machines; Temperature sensors; Estimation; Mathematical models; Hidden Markov models; Temperature measurement; Semisupervised learning; Environmental monitoring; Sensors; Occupancy estimation; semi-supervised learning; environmental sensors; machine learning; PREDICTION;
D O I
10.1109/ACCESS.2024.3449810
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Occupancy information is essential for space management, energy efficiency, and in times of the COVID-19 pandemic, for crowd control. Obtaining labeled data is challenging due to hardware limitations, privacy considerations, and the required underlying costs. This study demonstrates the benefits of using Semi-Supervised Learning (SSL) for occupancy estimation in enclosed spaces; which requires less labeled data than other Machine Learning (ML) methods. It presents an empirical comparison between Supervised ML and SSL models in three real-life university classrooms (uncontrolled conditions). The data was collected for three weeks at each classroom using an in-house developed Internet of Things (IoT) device that measures air temperature, relative humidity, and atmospheric pressure. The ground truth records were gathered through manual logging of occupancy levels. Datasets' sizes averaged 2350 entries with only 280 labeled instances per dataset. Support Vector Machine (SVM), Random Forest (RF), and Multi-Layer Perceptron (MLP) were used to define a performance baseline for supervised ML. Self-Training (ST) and Label Propagation (LP) were tested for SSL. ST achieved superior performance compared to baseline models (SVM, RF, MLP) with a highest average accuracy of 87.33% compared to SVM (86.66%). These results demonstrate the effectiveness of SSL for indirect occupancy estimation while reducing the need for extensive data collection and labeling.
引用
收藏
页码:118673 / 118693
页数:21
相关论文
共 50 条
  • [1] A Non-Intrusive Approach for Indoor Occupancy Detection in Smart Environments
    Abade, Bruno
    Abreu, David Perez
    Curado, Marilia
    [J]. SENSORS, 2018, 18 (11)
  • [2] Acuña E, 2004, ST CLASS DAT ANAL, P639
  • [3] Indoor Occupancy Detection and Estimation using Machine Learning and Measurements from an IoT LoRa-based Monitoring System
    Adeogun, Ramoni
    Rodriguez, Ignacio
    Razzaghpour, Mohammad
    Berardinelli, Gilberto
    Christensen, Per Hartmann
    Mogensen, Preben Elgaard
    [J]. 2019 GLOBAL IOT SUMMIT (GIOTS), 2019,
  • [4] Predicting Human Count through Environmental Sensing in Closed Indoor Settings
    Ahmed, Shamir
    Kamal, Uday
    Toha, Tarik Reza
    Islam, Nafisa
    Al Islam, A. B. M. Alim
    [J]. PROCEEDINGS OF THE 15TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2018), 2018, : 49 - 58
  • [5] Akkaya B., 2019, P INT C REC DEV DAT, P1
  • [6] Non-Intrusive Room Occupancy Prediction Performance Analysis Using Different Machine Learning Techniques
    Aliero, Muhammad S.
    Pasha, Muhammad F.
    Smith, David T.
    Ghani, Imran
    Asif, Muhammad
    Jeong, Seung Ryul
    Samuel, Moveh
    [J]. ENERGIES, 2022, 15 (23)
  • [7] [Anonymous], 2023, Aws IoT Core: Developer Guide
  • [8] aws.amazon, 2022, Tutorial: Storing Device Data a DynamoDB Table
  • [9] Occupancy modeling on non-intrusive indoor environmental data through machine learning
    Banihashemi, Farzan
    Weber, Manuel
    Deghim, Fatma
    Zong, Chujun
    Lang, Werner
    [J]. BUILDING AND ENVIRONMENT, 2024, 254
  • [10] Baratloo A, 2015, EMERGENCY, V3, P48