Privacy-Safe Action Recognition via Cross-Modality Distillation
被引:0
|
作者:
Kim, Yuhyun
论文数: 0引用数: 0
h-index: 0
机构:
Hanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South KoreaHanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South Korea
Kim, Yuhyun
[1
]
Jung, Jinwook
论文数: 0引用数: 0
h-index: 0
机构:
Hanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South KoreaHanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South Korea
Jung, Jinwook
[1
]
Noh, Hyeoncheol
论文数: 0引用数: 0
h-index: 0
机构:
Hanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South KoreaHanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South Korea
Noh, Hyeoncheol
[1
]
Ahn, Byungtae
论文数: 0引用数: 0
h-index: 0
机构:
Korea Inst Machinery & Mat, Daejeon 34103, South KoreaHanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South Korea
Ahn, Byungtae
[2
]
Kwon, Junghye
论文数: 0引用数: 0
h-index: 0
机构:
Chungnam Natl Univ, Coll Med, Dept Internal Med, Div Hematol Oncol, Daejeon 34134, South KoreaHanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South Korea
Kwon, Junghye
[3
]
论文数: 引用数:
h-index:
机构:
Choi, Dong-Geol
[1
]
机构:
[1] Hanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South Korea
[2] Korea Inst Machinery & Mat, Daejeon 34103, South Korea
[3] Chungnam Natl Univ, Coll Med, Dept Internal Med, Div Hematol Oncol, Daejeon 34134, South Korea
来源:
IEEE ACCESS
|
2024年
/
12卷
关键词:
Action recognition;
knowledge distillation;
cross-modality distillation;
deep learning;
multi modal;
privacy-safe;
D O I:
10.1109/ACCESS.2024.3431227
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Human action recognition systems enhance public safety by detecting abnormal behavior autonomously. RGB sensors commonly used in such systems capture personal information of subjects and, as a result, run the risk of potential privacy leakage. On the other hand, privacy-safe alternatives, such as depth or thermal sensors, exhibit poorer performance because they lack the semantic context provided by RGB sensors. Moreover, the data availability of privacy-safe alternatives is significantly lower than RGB sensors. To address these problems, we explore effective cross-modality distillation methods in this paper, aiming to distill the knowledge of context-rich large-scale pre-trained RGB-based models into privacy-safe depth-based models. Based on extensive experiments on multiple architectures and benchmark datasets, we propose an effective method for training privacy-safe depth-based action recognition models via cross-modality distillation: cross-modality mixing distillation. This approach improves both the performance and efficiency by enabling interaction between depth and RGB modalities through a linear combination of their features. By utilizing the proposed cross-modal mixing distillation approach, we achieve state-of-the-art accuracy in two depth-based action recognition benchmarks. The code and the pre-trained models will be available upon publication.