The Jester Dataset: A Large-Scale Video Dataset of Human Gestures

被引:122
|
作者
Materzynska, Joanna [1 ]
Berger, Guillaume [1 ]
Bax, Ingo [1 ]
Memisevic, Roland [1 ]
机构
[1] Twenty Bill Neurons GmbH, Berlin, Germany
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) | 2019年
关键词
D O I
10.1109/ICCVW.2019.00349
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gesture recognition and its application in human-computer interfaces have been growing increasingly popular in recent years. Although many gestures can be recognized from a single image frame, to build a responsive, accurate system, that can recognize complex gestures with subtle differences between them we need large-scale real-world video datasets. In this work, we introduce the largest collection of short clips of videos of humans performing gestures in front of the camera. The dataset has been collected with the help of over 1300 different actors in their unconstrained environments. Additionally, we present an ongoing gesture recognition challenge based on our dataset and the current results. We also describe how a baseline achieving over 93% recognition accuracy can be obtained with a simple 31) convolutional neural network.
引用
收藏
页码:2874 / 2882
页数:9
相关论文
共 50 条
  • [1] DMDD: A Large-Scale Dataset for Dataset Mentions Detection
    Pan, Huitong
    Zhang, Qi
    Dragut, Eduard
    Caragea, Cornelia
    Latecki, Longin Jan
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 1132 - 1146
  • [2] VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild
    Miao, Jiaxu
    Wei, Yunchao
    Wu, Yu
    Liang, Chen
    Li, Guangrui
    Yang, Yi
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4131 - 4141
  • [3] COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
    Tang, Yansong
    Ding, Dajun
    Rao, Yongming
    Zheng, Yu
    Zhang, Danyang
    Zhao, Lili
    Lu, Jiwen
    Zhou, Jie
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1207 - 1216
  • [4] A Large-scale TV Dataset for Partial Video Copy Detection
    Van-Hao Le
    Delalandre, Mathieu
    Conte, Donatello
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 388 - 399
  • [5] A Large-scale Benchmark Dataset for Event Recognition in Surveillance Video
    Oh, Sangmin
    Hoogs, Anthony
    Perera, Amitha
    Cuntoor, Naresh
    Chen, Chia-Chih
    Lee, Jong Taek
    Mukherjee, Saurajit
    Aggarwal, J. K.
    Lee, Hyungtae
    Davis, Larry
    Swears, Eran
    Wang, Xioyang
    Ji, Qiang
    Reddy, Kishore
    Shah, Mubarak
    Vondrick, Carl
    Pirsiavash, Hamed
    Ramanan, Deva
    Yuen, Jenny
    Torralba, Antonio
    Song, Bi
    Fong, Anesco
    Roy-Chowdhury, Amit
    Desai, Mita
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [6] A large-scale fMRI dataset for human action recognition
    Zhou, Ming
    Gong, Zhengxin
    Dai, Yuxuan
    Wen, Yushan
    Liu, Youyi
    Zhen, Zonglei
    SCIENTIFIC DATA, 2023, 10 (01)
  • [7] A large-scale fMRI dataset for human action recognition
    Ming Zhou
    Zhengxin Gong
    Yuxuan Dai
    Yushan Wen
    Youyi Liu
    Zonglei Zhen
    Scientific Data, 10
  • [8] MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection
    Corona, Kellie
    Osterdahl, Katie
    Collins, Roderic
    Hoogs, Anthony
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1059 - 1067
  • [9] LDPolypVideo Benchmark: A Large-Scale Colonoscopy Video Dataset of Diverse Polyps
    Ma, Yiting
    Chen, Xuejin
    Cheng, Kai
    Li, Yang
    Sun, Bin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT V, 2021, 12905 : 387 - 396
  • [10] Diverse Scene Stitching from a Large-Scale Aerial Video Dataset
    Yang, Tao
    Li, Jing
    Yu, Jingyi
    Wang, Sibing
    Zhang, Yanning
    REMOTE SENSING, 2015, 7 (06) : 6932 - 6949