Eye-wearable head-mounted tracking and gaze estimation interactive machine system for human-machine interface

被引:7
作者
Lee, Ko-Feng [1 ]
Chen, Yen-Lin [1 ]
Yu, Chao-Wei [1 ]
Jen, Cheng-Lung [2 ]
Chin, Kai-Yi [3 ]
Hung, Chen-Wei [4 ]
Wen, Chih-Bo [1 ]
机构
[1] Natl Taipei Univ Technol, Dept Comp Sci & Informat Engn, Taipei 10608, Taiwan
[2] MediaTek Inc, Hsinchu, Taiwan
[3] Aletheia Univ, Dept Digital Humanities & Informat Applicat, New Taipei, Taiwan
[4] Univ Tunku Abdul Rahman, Fac Informat & Commun Technol, Perak, Malaysia
关键词
Eye tracking; gaze estimation; machine vision; head-mounted; human-machine interface; CALIBRATION;
D O I
10.1177/1461348419875047
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study, a head-mounted camera was used to track eye behaviors and estimate the gaze point on the user's visual plane. The integration of the elastic mechanism design makes the headset adaptable for various users. The wearable cases were prototyped with low-cost cameras to produce an efficient eye tracking solution. This proposed system can effectively extract and estimate pupil ellipse from a few camera images of an eye and compute the corresponding three-dimensional eye model. The system can match later images of the same pupil ellipse from a head-mounted camera to give the possible visual angles. To estimate the gaze point, the system uses multiple-point calibration to solve the related polynomial formula for future angle-to-gaze mapping. The proposed eye-tracking algorithms can provide a low-complexity solution with high accuracy, precision, and speed. This tracking system is a low-cost and promising system that can be used in headsets for virtual reality, auxiliary equipment, interactive machine, and human-machine interface applications. The proposed eye-tracking algorithm can achieve satisfactory performance without using a high-end high-speed camera and can be detected under different lighting sources, and the average errors of the detection results are stably within 9 pixels and at a distance of 50 cm from the screen; while the average error of the fixation mapping results is within 3 degrees.
引用
收藏
页码:18 / 38
页数:21
相关论文
共 28 条
  • [1] [Anonymous], 2017, DIGITIME
  • [2] [Anonymous], 2017, TECHNEWS
  • [3] [Anonymous], 1992, SHAPE DETECTION COMP
  • [4] How people look at pictures before, during, and after scene capture: Buswell revisited
    Babcock, JS
    Lipps, M
    Pelz, JB
    [J]. HUMAN VISION AND ELECTRONIC IMAGING VII, 2002, 4662 : 34 - 47
  • [5] A Vision-Based Driver Nighttime Assistance and Surveillance System Based on Intelligent Image Sensing Techniques and a Heterogamous Dual-Core Embedded System Architecture
    Chen, Yen-Lin
    Chiang, Hsin-Han
    Chiang, Chuan-Yen
    Liu, Chuan-Ming
    Yuan, Shyan-Ming
    Wang, Jenq-Haur
    [J]. SENSORS, 2012, 12 (03) : 2373 - 2399
  • [6] Vision-Based Finger Detection, Tracking, and Event Identification Techniques for Multi-Touch Sensing and Display Systems
    Chen, Yen-Lin
    Liang, Wen-Yew
    Chiang, Chuan-Yen
    Hsieh, Tung-Ju
    Lee, Da-Cheng
    Yuan, Shyan-Ming
    Chang, Yang-Lang
    [J]. SENSORS, 2011, 11 (07): : 6868 - 6892
  • [7] Cheng-Lung Jen, 2016, 2016 IEEE International Conference on Consumer Electronics (ICCE), P202, DOI 10.1109/ICCE.2016.7430580
  • [8] Cherif ZR, 2002, IEEE IMTC P, P1029, DOI 10.1109/IMTC.2002.1007096
  • [9] Human eye localization using the modified Hough transform
    Dobes, M.
    Martinek, J.
    Skoupil, D.
    Dobesova, Z.
    Pospisil, J.
    [J]. OPTIK, 2006, 117 (10): : 468 - 473
  • [10] LEAST-SQUARES FITTING OF CIRCLES AND ELLIPSES
    GANDER, W
    GOLUB, GH
    STREBEL, R
    [J]. BIT, 1994, 34 (04): : 558 - 578