MDST: 2-D Human Pose Estimation for SISO UWB Radar Based on Micro-Doppler Signature via Cascade and Parallel Swin Transformer

被引:2
|
作者
Zhou, Xiaolong [1 ]
Jin, Tian [1 ]
Dai, Yongpeng [1 ]
Song, Yongping [1 ]
Li, Kemeng [1 ]
Song, Shaoqiu [1 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410005, Peoples R China
基金
中国国家自然科学基金;
关键词
Pose estimation; Sensors; Ultra wideband radar; Radar imaging; Transformers; Radar; Heating systems; Human pose estimation; human pose estimation based on single-input single-output (SISO) ultra-wideband (UWB) radar (HPSUR) benchmark; inverted feature pyramid network (IFPN); micro-Doppler (MD); SISO UWB radar; Swin Transformer; MMWAVE RADARS; TRACKING;
D O I
10.1109/JSEN.2024.3401861
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article introduces the human pose estimation based on the single-input single-output (SISO) ultra-wideband (UWB) radar (HPSUR) benchmark, a pioneering approach in human pose estimation integrating motion capture (MOCAP) technology based on SISO UWB radar sensors. The HPSUR dataset, consisting of 311963 data frames, was meticulously assembled using cross-calibrated SISO UWB radar sensors in conjunction with the Noitom Perception Neuron 3 (N3), specifically designed for radar-based human pose estimation. This dataset captures diverse movements from five subjects of varying physical characteristics, performing four distinct categories of actions. In addition to establishing this comprehensive benchmark, this article proposes an innovative framework for 2-D HPSUR. The framework leverages the processing of micro-Doppler (MD) signatures through a unique combination of cascade and parallel Swin Transformers. The MD signatures, reflective of human kinematics, form the basis for a novel methodology in posture identification, thus enhancing the perception of human postures. Addressing the challenge of managing long-range dependencies due to the high sampling rates of radar devices, we introduce the MD Swin Transformer (MDST) network. This novel transformer incorporates window-based multihead self-attention (W-MSA) and shifted window-based multihead self-attention (SW-MSA) models to capture the inner-frame and intraframe aspects of the MD signature adeptly. Furthermore, this study integrates an inverted feature pyramid network (IFPN) for an efficient multiscale feature representation, enriching the feature pyramid with high-level semantics. Our extensive experimental analysis, conducted on the HPSUR benchmark, demonstrates the significant enhancement in the accuracy of human pose estimation offered by the proposed MDST network. This improvement is consistently observed across six MDST variants under various conditions involving diverse subjects and postures, showcasing the robustness and generalizability of our approach.
引用
收藏
页码:21730 / 21749
页数:20
相关论文
共 1 条
  • [1] Three-Dimensional Human Pose Estimation from Micro-Doppler Signature Based on SISO UWB Radar
    Zhou, Xiaolong
    Jin, Tian
    Dai, Yongpeng
    Song, Yongping
    Li, Kemeng
    REMOTE SENSING, 2024, 16 (07)