Bi-Modal Multiperspective Percussive (BiMP) Dataset for Visual and Audio Human Fall Detection

被引:0
|
作者
Dibble, Joe [1 ]
Bazzocchi, Michael C. F. [1 ,2 ]
机构
[1] Clarkson Univ, Dept Mech & Aerosp Engn, Astronaut & Robot Lab ASTRO Lab, Potsdam, NY 13699 USA
[2] York Univ, Dept Earth & Space Sci & Engn, Astronaut & Robot Lab ASTRO Lab, Toronto, ON M3J 1P3, Canada
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Visualization; Fall detection; Cameras; Gyroscopes; Accelerometers; Older adults; Feature extraction; Long short term memory; Support vector machines; Smart homes; Assisted living; audio analysis; fall detection; machine learning; multimodal dataset; DETECTION SYSTEM; CARE;
D O I
10.1109/ACCESS.2025.3531324
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Falls are the leading cause of injuries and fatalities among older individuals propagating concerns for health and safety. Mitigating these concerns requires timely intervention and response to minimize health complications. Autonomous fall detection systems are often offered as a means to alleviate these concerns. Typical fall detection systems classify a fall event using either inertial- or vision-based data. Aside from these two modes of input for fall detection, sparse human fall audio data is available. Audio datasets associated with the dynamic impact of human falls from the perspective of a spatial environment are both unavailable publicly and focused on niche applications. This dataset provides multimodal composition of visual and audio data to aid in fall detection technique development. Though this dataset emphasizes audio novelty, a synchronized visual component is included offering a multifaceted collection of data. Audio input for fall detection systems presents an opportunity for new approaches towards autonomous fall detection. This dataset comprises human movement representing activities of daily living and falls of 25 participants in various residential settings, yielding 1,300 instances of unique visual and audio samples. Promising utility of the bi-modal multiperspective percussive (BiMP) dataset is demonstrated through experimental data evaluations using techniques including: GoogLeNet, Long Short Term Memory, Continuous Wavelet Transforms, and Short-time Fourier Transforms for human fall detection achieving accuracies up to 96%. The concept of audio-based fall detection has the potential to mitigate concerns regarding privacy and invasiveness, in addition to broadening the scope of fall detection mechanisms.
引用
收藏
页码:26782 / 26797
页数:16
相关论文
共 13 条
  • [1] Multi Visual Modality Fall Detection Dataset
    Denkovski, Stefan
    Khan, Shehroz S.
    Malamis, Brandon
    Moon, Sae Young
    Ye, Bing
    Mihailidis, Alex
    IEEE ACCESS, 2022, 10 : 106422 - 106435
  • [2] Visual Surveillance for Human Fall Detection in Healthcare IoT
    Zhang, Yinlong
    Zheng, Xiaoyan
    Liang, Wei
    Zhang, Sichao
    Yuan, Xudong
    IEEE MULTIMEDIA, 2022, 29 (01) : 36 - 46
  • [3] Multi-Modal Anomaly Detection by Using Audio and Visual Cues
    Rehman, Ata-Ur
    Ullah, Hafiz Sami
    Farooq, Haroon
    Khan, Muhammad Salman
    Mahmood, Tayyeb
    Khan, Hafiz Owais Ahmed
    IEEE ACCESS, 2021, 9 : 30587 - 30603
  • [4] Bi-modal annoyance level detection from speech and text
    Justo, Raquel
    Irastorza, Jon
    Perez, Saioa
    Ines Torres, M.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2018, (61): : 83 - 89
  • [5] Bi-Modal Transformer-Based Approach for Visual Question Answering in Remote Sensing Imagery
    Bazi, Yakoub
    Al Rahhal, Mohamad Mahmoud
    Mekhalfi, Mohamed Lamine
    Al Zuair, Mansour Abdulaziz
    Melgani, Farid
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] ABiFN: Attention-based bi-modal fusion network for object detection at night time
    Sai Charan, A.
    Jitesh, M.
    Chowdhury, M.
    Venkataraman, H.
    ELECTRONICS LETTERS, 2020, 56 (24) : 1309 - 1311
  • [7] Visual Based Fall Detection Through Human Shape Variation and Head Detection
    Chua, Jia-Luen
    Chang, Yoong Choon
    Lim, Wee Keong
    2013 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2013, : 61 - 65
  • [8] Advancing Human Fall Detection with TsetFall: A Comprehensive, Fine-Grained, and Publicly Available Dataset
    Dutra, Eduardo Facanha
    Carvalho de Oliveira, Thiago Rodrigo
    Formico Rodrigues, Maria Andreia
    2023 IEEE 11TH INTERNATIONAL CONFERENCE ON SERIOUS GAMES AND APPLICATIONS FOR HEALTH, SEGAH, 2023,
  • [9] Producing Synthetic Dataset for Human Fall Detection in AR/VR Environments
    Zherdev, Denis
    Zherdeva, Larisa
    Agapov, Sergey
    Sapozhnikov, Anton
    Nikonorov, Artem
    Chaplygin, Sergej
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [10] Integration of machine learning bi-modal engagement emotion detection model to self-reporting for educational satisfaction measurement
    Alhasson H.F.
    Alsaheel G.M.
    Alsalamah A.A.
    Alharbi N.S.
    Alhujilan J.M.
    Alharbi S.S.
    International Journal of Information Technology, 2024, 16 (6) : 3633 - 3647