JS']JSE: Joint Semantic Encoder for zero-shot gesture learning

被引:2
作者
Madapana, Naveen [1 ]
Wachs, Juan [1 ]
机构
[1] Purdue Univ, Sch Ind Engn, W Lafayette, IN 47906 USA
基金
美国医疗保健研究与质量局;
关键词
Zero-shot learning; Gesture recognition; Feature selection; Transfer learning; ACTION RECOGNITION; VISUALIZATION; INTERFACE;
D O I
10.1007/s10044-021-00992-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) is a transfer learning paradigm that aims to recognize unseen categories just by having a high-level description of them. While deep learning has greatly pushed the limits of ZSL for object classification, ZSL for gesture recognition (ZSGL) remains largely unexplored. Previous attempts to address ZSGL were focused on the creation of gesture attributes and algorithmic improvements, and there is little or no research concerned with feature selection for ZSGL. It is indisputable that deep learning has obviated the need for feature engineering for problems with large datasets. However, when the data are scarce, it is critical to leverage the domain information to create discriminative input features. The main goal of this work is to study the effect of three different feature extraction techniques (velocity, heuristical and latent features) on the performance of ZSGL. In addition, we propose a bilinear auto-encoder approach, referred to as Joint Semantic Encoder (JSE), for ZSGL that jointly minimizes the reconstruction, semantic and classification losses. We conducted extensive experiments to compare and contrast the feature extraction techniques and to evaluate the performance of JSE with respect to existing ZSL methods. For attribute-based classification scenario, irrespective of the feature type, results showed that JSE outperforms other approaches by 5% (p<0.01). When JSE is trained with heuristical features in across-category condition, we showed that JSE significantly outperforms other methods by 5% (p<0.01)).
引用
收藏
页码:679 / 692
页数:14
相关论文
共 50 条
[41]   Learning Unbiased Zero-Shot Semantic Segmentation Networks Via Transductive Transfer [J].
Lv, Fengmao ;
Liu, Haiyang ;
Wang, Yichen ;
Zhao, Jiayi ;
Yang, Guowu .
IEEE SIGNAL PROCESSING LETTERS, 2020, 27 :1640-1644
[42]   Research progress of zero-shot learning [J].
Sun, Xiaohong ;
Gu, Jinan ;
Sun, Hongying .
APPLIED INTELLIGENCE, 2021, 51 (06) :3600-3614
[43]   Zero-Shot Learning With Transferred Samples [J].
Guo, Yuchen ;
Ding, Guiguang ;
Han, Jungong ;
Gao, Yue .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) :3277-3290
[44]   Learning cross-domain semantic-visual relationships for transductive zero-shot learning [J].
Lv, Fengmao ;
Zhang, Jianyang ;
Yang, Guowu ;
Feng, Lei ;
Yu, Yufeng ;
Duan, Lixin .
PATTERN RECOGNITION, 2023, 141
[45]   SEMANTIC EMBEDDING SPACE FOR ZERO-SHOT ACTION RECOGNITION [J].
Xu, Xun ;
Hospedales, Timothy ;
Gong, Shaogang .
2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, :63-67
[46]   Variational Auto-Encoder Combined with Knowledge Graph Zero-Shot Learning [J].
Zhang, Haitao ;
Su, Lin .
Computer Engineering and Applications, 2023, 59 (01) :236-243
[47]   Multi-Modality Adversarial Auto-Encoder for Zero-Shot Learning [J].
Ji, Zhong ;
Dai, Guangwen ;
Yu, Yunlong .
IEEE ACCESS, 2020, 8 :9287-9295
[48]   JOINT PROBABILITY ESTIMATION OF ATTRIBUTE CHAIN FOR ZERO-SHOT LEARNING [J].
Qiao, Lingfeng ;
Tuo, Hongya ;
Fang, Zheng ;
Feng, Peng ;
Jing, Zhongliang .
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, :1863-1867
[49]   SVDML: Semantic and Visual Space Deep Mutual Learning for Zero-Shot Learning [J].
Lu, Nannan ;
Luo, Yi ;
Qiu, Mingkai .
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 :383-395
[50]   Improving Semantic Embedding Consistency by Metric Learning for Zero-Shot Classiffication [J].
Bucher, Maxime ;
Herbin, Stephane ;
Jurie, Frederic .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :730-746