JS']JSE: Joint Semantic Encoder for zero-shot gesture learning

被引:1
作者
Madapana, Naveen [1 ]
Wachs, Juan [1 ]
机构
[1] Purdue Univ, Sch Ind Engn, W Lafayette, IN 47906 USA
基金
美国医疗保健研究与质量局;
关键词
Zero-shot learning; Gesture recognition; Feature selection; Transfer learning; ACTION RECOGNITION; VISUALIZATION; INTERFACE;
D O I
10.1007/s10044-021-00992-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) is a transfer learning paradigm that aims to recognize unseen categories just by having a high-level description of them. While deep learning has greatly pushed the limits of ZSL for object classification, ZSL for gesture recognition (ZSGL) remains largely unexplored. Previous attempts to address ZSGL were focused on the creation of gesture attributes and algorithmic improvements, and there is little or no research concerned with feature selection for ZSGL. It is indisputable that deep learning has obviated the need for feature engineering for problems with large datasets. However, when the data are scarce, it is critical to leverage the domain information to create discriminative input features. The main goal of this work is to study the effect of three different feature extraction techniques (velocity, heuristical and latent features) on the performance of ZSGL. In addition, we propose a bilinear auto-encoder approach, referred to as Joint Semantic Encoder (JSE), for ZSGL that jointly minimizes the reconstruction, semantic and classification losses. We conducted extensive experiments to compare and contrast the feature extraction techniques and to evaluate the performance of JSE with respect to existing ZSL methods. For attribute-based classification scenario, irrespective of the feature type, results showed that JSE outperforms other approaches by 5% (p<0.01). When JSE is trained with heuristical features in across-category condition, we showed that JSE significantly outperforms other methods by 5% (p<0.01)).
引用
收藏
页码:679 / 692
页数:14
相关论文
共 50 条
  • [21] ENCYCLOPEDIA ENHANCED SEMANTIC EMBEDDING FOR ZERO-SHOT LEARNING
    Jia, Zhen
    Zhang, Junge
    Huang, Kaiqi
    Tan, Tieniu
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1287 - 1291
  • [22] Learning exclusive discriminative semantic information for zero-shot learning
    Jian-Xun Mi
    Zhonghao Zhang
    Debao Tai
    Li-Fang Zhou
    Wei Jia
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 761 - 772
  • [23] A study on zero-shot learning from semantic viewpoint
    Bhagat, P. K.
    Choudhary, Prakash
    Singh, Kh Manglem
    VISUAL COMPUTER, 2023, 39 (05) : 2149 - 2163
  • [24] A meaningful learning method for zero-shot semantic segmentation
    Xianglong Liu
    Shihao Bai
    Shan An
    Shuo Wang
    Wei Liu
    Xiaowei Zhao
    Yuqing Ma
    Science China Information Sciences, 2023, 66
  • [25] A Semantic Similarity Supervised Autoencoder for Zero-Shot Learning
    Shen, Fengli
    Lu, Zhe-Ming
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (06): : 1419 - 1422
  • [26] Learning complementary semantic information for zero-shot recognition
    Hu, Xiaoming
    Wang, Zilei
    Li, Junjie
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
  • [27] Spherical Zero-Shot Learning
    Shen, Jiayi
    Xiao, Zehao
    Zhen, Xiantong
    Zhang, Lei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 634 - 645
  • [28] Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview
    Ren, Wenqi
    Tang, Yang
    Sun, Qiyu
    Zhao, Chaoqiang
    Han, Qing-Long
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (05) : 1106 - 1126
  • [29] Zero-Shot Learning with Joint Generative Adversarial Networks
    Zhang, Minwan
    Wang, Xiaohua
    Shi, Yueting
    Ren, Shiwei
    Wang, Weijiang
    ELECTRONICS, 2023, 12 (10)
  • [30] JOINT ON-LINE LEARNING OF A ZERO-SHOT SPOKEN SEMANTIC PARSER AND A REINFORCEMENT LEARNING DIALOGUE MANAGER
    Riou, Matthieu
    Jabaian, Bassam
    Huet, Stephane
    Lefevre, Fabrice
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3072 - 3076