Reinforcement learning for facilitating human-robot-interaction in manufacturing

被引:75
作者
Oliff, Harley [1 ]
Liu, Ying [1 ]
Kumar, Maneesh [2 ]
Williams, Michael [3 ]
Ryan, Michael [1 ]
机构
[1] Cardiff Univ, Sch Engn, Inst Mech & Mfg Engn, Cardiff CF24 3AA, Wales
[2] Cardiff Univ, Cardiff Business Sch, Cardiff CF10 3EU, Wales
[3] Olympus Surg Technol Europe, Cardiff, Wales
关键词
Intelligent manufacturing; Reinforcement learning; Human-robot interaction; Human factors; Adaptability; CYBER-PHYSICAL SYSTEMS; INDUSTRY; 4.0; ARCHITECTURE; FUTURE; COLLABORATION; INTELLIGENCE; FATIGUE;
D O I
10.1016/j.jmsy.2020.06.018
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
For many contemporary manufacturing processes, autonomous robotic operators have become ubiquitous. Despite this, the number of human operators within these processes remains high, and as a consequence, the number of interactions between humans and robots has increased in this context. This is a problem, as human beings introduce a source of disturbance and unpredictability into these processes in the form of performance variation. Despite the natural human aptitude for flexibility, their presence remains a source of disturbance within the system and make modelling and optimization of these systems considerably more challenging, and in many cases impossible. Improving the ability of robotic operators to adapt their behaviour to variations in human task performance is, therefore, a significant challenge to be overcome to enable many ideas in the larger intelligent manufacturing paradigm to be realised. This work presents the development of a methodology to effectively model these systems and a reinforcement learning agent capable of autonomous decision-making. This decision-making provides the robotic operators with greater adaptability, by enabling its behaviour to change based on observed information, both of its environment and human colleagues. The work extends theoretical knowledge on how learning methods can be implemented for robotic control, and how the capabilities that they enable may be leveraged to improve the interaction between robots and their human counterparts. The work further presents a novel methodology for the implementation of a reinforcement learning-based intelligent agent which enables a change in behavioural policy in robotic operators in response to performance variation in their human colleagues. The development and evaluation are supported by a generalized simulation model, which is parameterized to enable appropriate variation in human performance. The evaluation demonstrates that the reinforcement agent can effectively learn to make adjustments to its behaviour based on the knowledge extracted from observed information, and balance the task demands to optimise these adjustments.
引用
收藏
页码:326 / 340
页数:15
相关论文
共 93 条
  • [61] Cyber-physical production systems: Roots, expectations and R&D challenges
    Monostori, Laszlo
    [J]. VARIETY MANAGEMENT IN MANUFACTURING: PROCEEDINGS OF THE 47TH CIRP CONFERENCE ON MANUFACTURING SYSTEMS, 2014, 17 : 9 - 13
  • [62] An autonomous manufacturing system based on swarm of cognitive agents
    Park, Hong-Seok
    Tran, Ngoc-Hien
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2012, 31 (03) : 337 - 348
  • [63] Self-optimizing production systems
    Permin, Eike
    Bertelsmeier, Felix
    Blum, Matthias
    Buetzler, Jennifer
    Haag, Sebastian
    Kuz, Sinem
    Oezdemir, Denis
    Stemmler, Sebastian
    Thombansen, Ulrich
    Schmitt, Robert
    Brecher, Christian
    Schlick, Christopher
    Abel, Dirk
    Poprawe, Reinhart
    Loosen, Peter
    Schulz, Wolfgang
    Schuh, Guenther
    [J]. RESEARCH AND INNOVATION IN MANUFACTURING: KEY ENABLING TECHNOLOGIES FOR THE FACTORIES OF THE FUTURE - PROCEEDINGS OF THE 48TH CIRP CONFERENCE ON MANUFACTURING SYSTEMS, 2016, 41 : 417 - 422
  • [64] Learning Physical Collaborative Robot Behaviors From Human Demonstrations
    Rozo, Leonel
    Calinon, Sylvain
    Caldwell, Darwin G.
    Jimenez, Pablo
    Torras, Carme
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (03) : 513 - 527
  • [65] Coronary Artery Disease From Isolated Non-H2-Determined Incompatibilities in Transplanted Mouse Hearts
    Russell, Paul S.
    Chase, Catharine M.
    Madsen, Joren C.
    Hirohashi, Tsutomu
    Cornell, Lynn D.
    Sproule, Thomas J.
    Colvin, Robert B.
    Roopenian, Derry C.
    [J]. TRANSPLANTATION, 2011, 91 (08) : 847 - 852
  • [66] RuSSmann Michael., 2015, Industry 4.0: The Future of Productivity and Growth in Manufacturing Industries, P14
  • [67] Sak H, 2014, INTERSPEECH, P338
  • [68] Salvucci DD, 2001, TRANSPORT RES REC, P9
  • [69] Deep learning in neural networks: An overview
    Schmidhuber, Juergen
    [J]. NEURAL NETWORKS, 2015, 61 : 85 - 117
  • [70] Schonsleben P, 2017, CIRP 50 C MAN SYST