Unsupervised online learning for fine-grained hand segmentation in egocentric video

被引:2
作者
Zhao, Ying [1 ,2 ]
Luo, Zhiwei [2 ]
Quan, Changqin [2 ]
机构
[1] Ricoh Software Res Ctr Beijing Co Ltd, Beijing, Peoples R China
[2] Kobe Univ, Dept Comp Sci, Kobe, Hyogo, Japan
来源
2017 14TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2017) | 2017年
关键词
hand detection; hand segmentation; egocentric; unsupervised online learning;
D O I
10.1109/CRV.2017.17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand segmentation is one of the most fundamental and crucial steps for egocentric human-computer interaction. The special egocentric view brings new challenges to hand segmentation task, such as the unpredictable environmental conditions. The performance of traditional hand segmentation methods depend on abundant manually labeled training data. However, these approaches do not appropriately capture the whole properties of egocentric human-computer interaction for neglecting the user-specific context. It is only necessary to build a personalized hand model of the active user. Based on this observation, we propose an online-learning hand segmentation approach without using manually labeled data for training. Our approach consists of top-down classifications and bottom-up optimizations. More specifically, we divide the segmentation task into three parts, a frame-level hand detection which detects the presence of the interactive hand using motion saliency and initializes hand masks for online learning, a superpixel-level hand classification which coarsely segments hand regions from which stable samples are selected for next level, and a pixel-level hand classification which produces a fine-grained hand segmentation. Based on the pixel-level classification result, we update the hand appearance model and optimize the upper layer classifier and detector. This online-learning strategy makes our approach robust to varying illumination conditions and hand appearances. Experimental results demonstrate the robustness of our approach.
引用
收藏
页码:248 / 255
页数:8
相关论文
共 26 条
  • [1] SLIC Superpixels Compared to State-of-the-Art Superpixel Methods
    Achanta, Radhakrishna
    Shaji, Appu
    Smith, Kevin
    Lucchi, Aurelien
    Fua, Pascal
    Suesstrunk, Sabine
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2274 - 2281
  • [2] [Anonymous], P AS C COMP VIS ACCV
  • [3] Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions
    Bambach, Sven
    Lee, Stefan
    Crandall, David J.
    Yu, Chen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1949 - 1957
  • [4] Gesture Recognition Using Wearable Vision Sensors to Enhance Visitors' Museum Experiences
    Baraldi, Lorenzo
    Paci, Francesco
    Serra, Giuseppe
    Benini, Luca
    Cucchiara, Rita
    [J]. IEEE SENSORS JOURNAL, 2015, 15 (05) : 2705 - 2714
  • [5] Left/right hand segmentation in egocentric videos
    Betancourt, Alejandro
    Morerio, Pietro
    Barakova, Emilia
    Marcenaro, Lucio
    Rauterberg, Matthias
    Regazzoni, Carlo
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 154 : 73 - 81
  • [6] TOWARDS A UNIFIED FRAMEWORK FOR HAND-BASED METHODS IN FIRST PERSON VISION.
    Betancourt, Alejandro
    Morerio, Pietro
    Marcenaro, Lucio
    Barakova, Emilia
    Rauterberg, Matthias
    Regazzoni, Carlo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2015,
  • [7] The Evolution of First Person Vision Methods: A Survey
    Betancourt, Alejandro
    Morerio, Pietro
    Regazzoni, Carlo S.
    Rauterberg, Matthias
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (05) : 744 - 760
  • [8] A Sequential Classifier for Hand Detection in the Framework of Egocentric Vision
    Betancourt, Alejandro
    Lopez, Miriam M.
    Regazzoni, Carlo S.
    Rauterberg, Matthias
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 600 - +
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893