Virtual Keyboards With Real-Time and Robust Deep Learning-Based Gesture Recognition

被引:4
作者
Lee, Tae-Ho [1 ]
Kim, Sunwoong [2 ]
Kim, Taehyun [1 ]
Kim, Jin-Sung [3 ]
Lee, Hyuk-Jae [1 ]
机构
[1] Seoul Natl Univ, Interuniv Semicond Res Ctr, Dept Elect & Comp Engn, Seoul 08226, South Korea
[2] Univ Washington, Div Engn & Math, Bothell, WA 98011 USA
[3] Sun Moon Univ, Dept Elect Engn, Asan 31460, South Korea
关键词
Layout; Keyboards; Indexes; Thumb; Mathematical models; Real-time systems; Optimization; Augmented reality (AR); deep learning (DL); gesture recognition (GR); keyboard layout optimization; virtual keyboard (VKB); virtual reality (VR); SYSTEM; LAYOUT;
D O I
10.1109/THMS.2022.3165165
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In head-mounted display devices for augmented reality and virtual reality, external signals are often entered using a virtual keyboard (VKB). Among various user interfaces for VKBs, hand gestures are widely used because they are fast and intuitive. This work proposes a gesture-recognition (GR)-based VKB algorithm that is accurate in any environment and operates in real time. Specifically, the proposed ambidextrous VKB layouts reduce the total finger travel distance on one-hand VKB layouts. Additionally, a fast typing action is proposed to use characteristics when previous and current keys are adjacent. To be robust in any environment, we utilize a deep learning (DL)-based GR method in the proposed VKB algorithm. To train DL networks, seven classes are defined and an automated dataset generation method is proposed to reduce the necessary time and effort. The proposed one-hand VKB layout with the fast typing action shows a 1.5x faster typing speed than the popular ABC keyboard layout. Furthermore, the proposed ambidextrous VKB layout brings an additional 52% improvement compared with the proposed one-hand VKB layout. The proposed DL-based GR method implemented on the well-known YOLOv3 machine learning framework shows a mean average precision rate of 95% for images including background colors similar to skin color. The proposed DL-based GR method for one-hand and ambidextrous VKBs achieves around 41 frames per second on a software platform, which allows real-time processing.
引用
收藏
页码:725 / 735
页数:11
相关论文
共 45 条
  • [1] [Anonymous], 2012, PHONE DIALABC K PADS
  • [2] [Anonymous], 1992, AUGMENT ALTERN COMM
  • [3] [Anonymous], 1987, Augmentative and alternative communication
  • [4] Aouam D., 2018, P 2018 3 INT C PATT, P1, DOI [DOI 10.1109/PAIS.2018.8598516, 10. 1109/PAIS.2018.8598516]
  • [5] Argyros AA, 2006, LECT NOTES COMPUT SC, V3979, P40
  • [6] Advantages of Eye-Gaze over Head-Gaze-Based Selection in Virtual and Augmented Reality under Varying Field of Views
    Blattgerste, Jonas
    Renner, Patrick
    Pfeiffer, Thies
    [J]. COMMUNICATION BY GAZE INTERACTION (COGAIN 2018), 2018,
  • [7] Carroll L., 1969, Alice's adventures in wonderland
  • [8] Chai Z., 2018, ARXIV181211090
  • [9] Chubon R A, 1988, J Rehabil Res Dev, V25, P17
  • [10] Dextto, 2018, 500 AM ENGL SENT MOS