Focal Channel Knowledge Distillation for Multi-Modality Action Recognition

被引：1

作者：

Gan, Lipeng ^{[1
]}

Cao, Runze ^{[1
]}

Li, Ning ^{[1
]}

Yang, Man ^{[1
]}

Li, Xiaochao ^{[1
,2
,3
]}

机构：

[1] Xiamen Univ, Dept Microelect & lntegrated Circuit, Xiamen 361005, Peoples R China

[2] Xiamen Univ Malaysia, Dept Elect & Elect Engn, Sepang 43900, Selangor, Malaysia

[3] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Action recognition; knowledge distillation; multi-modality;

D O I：

10.1109/ACCESS.2023.3298647

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The multi-modality action recognition aims to learn the complementary information from multiple modalities to improve the action recognition performance. However, there exists a significant modality channel difference, the equal transferring channel semantic features from multi-modalities to RGB will result in competition and redundancy during knowledge distillation. To address this issue, we propose a focal channel knowledge distillation strategy to transfer the key semantic correlations and distributions of multi-modality teachers into the RGB student network. The focal channel correlations provide intrinsic relationships and diversity properties of key semantics, and focal channel distributions provide salient channel activation of features. By ignoring the less-discriminative and irrelevant channels, the student can more efficiently utilize the channel capability to learn the complementary semantic features from the other modalities. Our focal channel knowledge distillation achieves 91.2%, 95.6%, 98.3% and 81.0% accuracy with 4.5%, 4.2%, 3.7% and 7.1% improvement on NTU 60 (CS), UTD-MHAD, N-UCLA and HMDB51 datasets comparing to unimodal RGB models. This focal channel knowledge distillation framework can also be integrated with the unimodal models to achieve the state-of-the-art performance. The extensive experiments show that the proposed method achieves 92.5%, 96.0%, 98.9%, and 82.3% accuracy on NTU 60 (CS), UTD-MHAD, N-UCLA, and HMDB51 datasets respectively.

引用

页码：78285 / 78298

页数：14

共 50 条

[21] An Encoder Generative Adversarial Network for Multi-modality Image Recognition
Chen, Yu
Yang, Chunling
Zhu, Min
Yang, ShiYan
IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 2689 - 2694
[22] Multi-Modality Adaptive Feature Fusion Graph Convolutional Network for Skeleton-Based Action Recognition
Zhang, Haiping
Zhang, Xinhao
Yu, Dongjin
Guan, Liming
Wang, Dongjing
Zhou, Fuxing
Zhang, Wanjun
SENSORS, 2023, 23 (12)
[23] SoccerKDNet: A Knowledge Distillation Framework for Action Recognition in Soccer Videos
Bose, Sarosij
Sarkar, Saikat
Chakrabarti, Amlan
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 457 - 464
[24] HarMI: Human Activity Recognition Via Multi-Modality Incremental Learning
Zhang, Xiao
Yu, Hongzheng
Yang, Yang
Gu, Jingjing
Li, Yujun
Zhuang, Fuzhen
Yu, Dongxiao
Ren, Zhaochun
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (03) : 939 - 951
[25] Teaching Yourself: A Self-Knowledge Distillation Approach to Action Recognition
Duc-Quang Vu
Le, Ngan
Wang, Jia-Ching
IEEE ACCESS, 2021, 9 : 105711 - 105723
[26] Multi-Modality Emotion Recognition Model with GAT-Based Multi-Head Inter-Modality Attention
Fu, Changzeng
Liu, Chaoran
Ishi, Carlos Toshinori
Ishiguro, Hiroshi
SENSORS, 2020, 20 (17) : 1 - 15
[27] MLKD-CLIP: Multi-layer Feature Knowledge Distillation of CLIP for Open-vocabulary Action Recognition
Jingjing Wang
Junyong Ye
Xinyuan Liu
Youwei Li
Guangyi Xu
Chaoming Zheng
Multimedia Systems, 2025, 31 (3)
[28] Multi-modality in girls' game disputes
Goodwin, MH
Goodwin, C
Yaeger-Dror, M
JOURNAL OF PRAGMATICS, 2002, 34 (10-11) : 1621 - 1649
[29] Modality- and Subject-Aware Emotion Recognition Using Knowledge Distillation
Sarikaya, Mehmet Ali
Ince, Gokhan
IEEE ACCESS, 2024, 12 : 122485 - 122502
[30] Progress on Multi-Modality Molecular Imaging
Bai, Jing
Liu, Fei
Liu, Xin
CURRENT MEDICAL IMAGING, 2012, 8 (04) : 295 - 301

← 1 2 3 4 5 →