Deep Learning for Intelligent Human-Computer Interaction

被引:36
|
作者
Lv, Zhihan [1 ]
Poiesi, Fabio [2 ]
Dong, Qi [3 ]
Lloret, Jaime [4 ]
Song, Houbing [5 ]
机构
[1] Uppsala Univ, Fac Arts, Dept Game Design, SE-62167 Uppsala, Sweden
[2] Fdn Bruno Kessler, Digital Ind Ctr, Technol Vis, Via Sommar 18, I-38123 Trento, Italy
[3] Amazon AWS AI, Seattle, WA 98125 USA
[4] Univ Politecn Valencia, Inst Invest Gest Integrada Zonas Costeras, Valencia 46022, Spain
[5] Embry Riddle Aeronaut Univ, Secur & Optimizat Networked Globe Lab SONG Lab, Daytona Beach, FL 32114 USA
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 22期
关键词
human-computer interaction; deep learning; speech recognition; gesture recognition; emotion recognition; HUMAN ACTION RECOGNITION; SYSTEM; MODEL; LSTM;
D O I
10.3390/app122211457
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In recent years, gesture recognition and speech recognition, as important input methods in Human-Computer Interaction (HCI), have been widely used in the field of virtual reality. In particular, with the rapid development of deep learning, artificial intelligence, and other computer technologies, gesture recognition and speech recognition have achieved breakthrough research progress. The search platform used in this work is mainly the Google Academic and literature database Web of Science. According to the keywords related to HCI and deep learning, such as "intelligent HCI", "speech recognition", "gesture recognition", and "natural language processing", nearly 1000 studies were selected. Then, nearly 500 studies of research methods were selected and 100 studies were finally selected as the research content of this work after five years (2019-2022) of year screening. First, the current situation of the HCI intelligent system is analyzed, the realization of gesture interaction and voice interaction in HCI is summarized, and the advantages brought by deep learning are selected for research. Then, the core concepts of gesture interaction are introduced and the progress of gesture recognition and speech recognition interaction is analyzed. Furthermore, the representative applications of gesture recognition and speech recognition interaction are described. Finally, the current HCI in the direction of natural language processing is investigated. The results show that the combination of intelligent HCI and deep learning is deeply applied in gesture recognition, speech recognition, emotion recognition, and intelligent robot direction. A wide variety of recognition methods were proposed in related research fields and verified by experiments. Compared with interactive methods without deep learning, high recognition accuracy was achieved. In Human-Machine Interfaces (HMIs) with voice support, context plays an important role in improving user interfaces. Whether it is voice search, mobile communication, or children's speech recognition, HCI combined with deep learning can maintain better robustness. The combination of convolutional neural networks and long short-term memory networks can greatly improve the accuracy and precision of action recognition. Therefore, in the future, the application field of HCI will involve more industries and greater prospects are expected.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] A Review on Human-Computer Interaction and Intelligent Robots
    Ren, Fuji
    Bao, Yanwei
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2020, 19 (01) : 5 - 47
  • [2] Intelligent Human-Computer Interaction Based on Surface EMG Gesture Recognition
    Qi, Jinxian
    Jiang, Guozhang
    Li, Gongfa
    Sun, Ying
    Tao, Bo
    IEEE ACCESS, 2019, 7 : 61378 - 61387
  • [3] Hand Gesture Control for Human-Computer Interaction with Deep Learning
    Chua, S. N. David
    Chin, K. Y. Richard
    Lim, S. F.
    Jain, Pushpdant
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2022, 17 (03) : 1961 - 1970
  • [4] RESEARCH ON INTELLIGENT HUMAN-COMPUTER INTERACTION TECHNOLOGY OF TOBACCO SYSTEM
    Liu Kai
    Yang Yutao
    Zhao Min
    Zhang Liang
    Yang Yaojing
    Yan Dazhu
    He Weiyang
    Tian Wenong
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [5] Multistage Deep Transfer Learning for EmIoT-Enabled Human-Computer Interaction
    Liu, Rui
    Liu, Qi
    Zhu, Hongxu
    Cao, Hui
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (16) : 15128 - 15137
  • [6] Smart interfaces for human-computer intelligent interaction
    Yven, J
    Wechsler, H
    CCA 2003: PROCEEDINGS OF 2003 IEEE CONFERENCE ON CONTROL APPLICATIONS, VOLS 1 AND 2, 2003, : 1192 - 1197
  • [7] Human-Computer Interaction in Intelligent Tutoring Systems
    Toala, Ramon
    Duraes, Dalila
    Novais, Paulo
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 16TH INTERNATIONAL CONFERENCE, 2020, 1003 : 52 - 59
  • [8] A Survey of Human-Computer Interaction Technology in Intelligent Home for Children’s Learning
    Li M.
    Zha S.
    Gong W.
    Jia Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (02): : 248 - 261
  • [9] Human-Computer Interaction in Currency Exchange
    Rivas, Alberto
    Martin-Limorti, Javier J.
    Chamoso, Pablo
    Gonzalez-Briones, Alfonso
    De La Prieta, Fernando
    Rodriguez, Sara
    KNOWLEDGE MANAGEMENT IN ORGANIZATIONS, KMO 2018, 2018, 877 : 390 - 400
  • [10] A Multimodal Human-Computer Interaction for Smart Learning System
    Alzubi, Tareq Mahmod
    Alzubi, Jafar A.
    Singh, Ashish
    Alzubi, Omar A.
    Subramanian, Murali
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2025, 41 (03) : 1718 - 1728