Real-Time Human-Computer Interaction Using Eye Gazes

被引:8
作者
Chen, Haodong [1 ]
Zendehdel, Niloofar [1 ]
Leu, Ming C. [1 ]
Yin, Zhaozheng [2 ,3 ]
机构
[1] Missouri Univ Sci & Technol, Dept Mech & Aerosp Engn, Rolla, MO 65409 USA
[2] SUNY Stony Brook, Dept Biomed Informat, Stony Brook, NY 11794 USA
[3] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
基金
美国国家科学基金会;
关键词
Eye gaze recognition; Human-computer Interaction; Instance Segmentation; Mask R-CNN;
D O I
10.1016/j.mfglet.2023.07.024
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Eye gaze emerges as a unique channel in human-computer interaction (HCI) that recognizes human intention based on gaze behavior and enables contactless access to control and operate software interfaces on computers. In this paper, we propose a real-time HCI system using eye gaze. First, we capture and track eyes using the Dlib 68-point landmark detector, and design an eye gaze recognition model to recognize four types of eye gazes. Then, we construct an instance segmentation model to recognize and segment tools and parts using the Mask Region-Based Convolutional Neural Network (R-CNN) method. After that, we design an HCI software interface by integrating and visualizing the proposed eye gaze recognition and instance segmentation models. The HCI system captures, tracks, and recognizes the eye gaze through a red-green-blue (RGB) webcam, and provides responses based on the detected eye gaze, including the tool and part segmentation, object selection and interface switching. Experimental results show that the proposed eye gaze recognition method achieves an accuracy of > 99% in a recommended distance between the eyes and the webcam, and the instance segmentation model achieves an accuracy of 99%. The experimental results of the HCI system operation demonstrate the feasibility and robustness of the proposed real-time HCI system. (c) 2023 The Authors. Published by ELSEVIER Ltd. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0)
引用
收藏
页码:883 / 894
页数:12
相关论文
共 28 条
[1]  
Bali J., 2014, J Clin Ophthalmol Res, V2, P61, DOI [10.4103/2320-3897.122661, DOI 10.4103/2320-3897.122661, 10.4103/23203897.122661, DOI 10.4103/23203897.122661]
[2]  
Beymer D, 2003, PROC CVPR IEEE, P451
[3]   Design of a robotic rehabilitation system for mild cognitive impairment based on computer vision [J].
Chen, Hao-Dong ;
Zhu, Hongbo ;
Teng, Zhiqiang ;
Zhao, Ping .
Journal of Engineering and Science in Medical Diagnostics and Therapy, 2020, 3 (02)
[4]  
Chen HD, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS, ROBOTICS AND AUTOMATION (ICMRA), P105, DOI 10.1109/ICMRA.2018.8490562
[5]   Real-Time Multi-Modal Human-Robot Collaboration Using Gestures and Speech [J].
Chen, Haodong ;
Leu, Ming C. ;
Yin, Zhaozheng .
JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2022, 144 (10)
[6]  
Chen HD, 2020, PROCEEDINGS OF THE ASME 2020 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2020, VOL 2B
[7]   An Integrated Target Acquisition Approach and Graphical User Interface Tool for Parallel Manipulator Assembly [J].
Chen, Haodong ;
Teng, Zhiqiang ;
Guo, Zheng ;
Zhao, Ping .
JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2020, 20 (02)
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   A review of semantic segmentation using deep neural networks [J].
Guo, Yanming ;
Liu, Yu ;
Georgiou, Theodoros ;
Lew, Michael S. .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2018, 7 (02) :87-93
[10]   A survey on instance segmentation: state of the art [J].
Hafiz, Abdul Mueed ;
Bhat, Ghulam Mohiuddin .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2020, 9 (03) :171-189