Real-time Motion Generation for Imaginary Creatures Using Hierarchical Reinforcement Learning

被引:0
|
作者
Ogaki, Keisuke [1 ]
Nakamura, Masayoshi [1 ]
机构
[1] DWANGO Co Ltd, Tokyo, Japan
来源
SIGGRAPH'18: ACM SIGGRAPH 2018 STUDIO | 2018年
关键词
Reinforcement Learning; Q-Learning; Neural Network;
D O I
10.1145/3214822.3214826
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Describing the motions of imaginary original creatures is an essential part of animations and computer games. One approach to generate such motions involves finding an optimal motion for approaching a goal by using the creatures' body and motor skills. Currently, researchers are employing deep reinforcement learning (DeepRL) to find such optimal motions. Some end-to-end DeepRL approaches learn the policy function, which outputs target pose for each joint according to the environment. In our study, we employed a hierarchical approach with a separate DeepRL decision maker and simple exploration-based sequence maker, and an action token, through which these two layers can communicate. By optimizing these two functions independently, we can achieve a light, fast-learning system available on mobile devices. In addition, we propose another technique to learn the policy at a faster pace with the help of a heuristic rule. By treating the heuristic rule as an additional action token, we can naturally incorporate it via Q-learning. The experimental results show that creatures can achieve better performance with the use of both heuristics and DeepRL than by using them independently.
引用
收藏
页数:2
相关论文
共 50 条
  • [31] Integration of Adaptive Control and Reinforcement Learning for Real-Time Control and Learning
    Annaswamy, Anuradha M.
    Guha, Anubhav
    Cui, Yingnan
    Tang, Sunbochen
    Fisher, Peter A.
    Gaudio, Joseph E.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7740 - 7755
  • [32] Application of Reinforcement Learning for Real-Time Optimal Control of the Pellet Induration Process
    Jayasree Biswas
    Akash Goyal
    Balaji Selvanathan
    Sri Harsha Nistala
    Venkataramana Runkana
    Transactions of the Indian Institute of Metals, 2022, 75 : 2539 - 2546
  • [33] Application of Reinforcement Learning for Real-Time Optimal Control of the Pellet Induration Process
    Biswas, Jayasree
    Goyal, Akash
    Selvanathan, Balaji
    Nistala, Sri Harsha
    Runkana, Venkataramana
    TRANSACTIONS OF THE INDIAN INSTITUTE OF METALS, 2022, 75 (10) : 2539 - 2546
  • [34] Hierarchical Reinforcement Learning Approach for Motion Planning in Mobile Robotics
    Buitrago-Martinez, Andrea
    De la Rosa R, Fernando
    Lozano-Martinez, Fernando
    2013 IEEE LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS 2013), 2013, : 83 - 88
  • [35] Reinforcement learning for optimizing real-time interventions and personalized feedback using wearable sensors
    Tripathy, Jyotsnarani
    Balasubramani, M.
    Rajan, V. Aravinda
    S, Vimalathithan
    Aeron, Anurag
    Arora, Meena
    Measurement: Sensors, 2024, 33
  • [36] Using real-time manufacturing data to schedule a smart factory via reinforcement learning
    Gu, Wenbin
    Li, Yuxin
    Tang, Dunbing
    Wang, Xianliang
    Yuan, Minghai
    COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 171
  • [37] Potent Real-Time Recommendations Using Multi-Model Contextual Reinforcement Learning
    Kabra, Anubha
    Agarwal, Anu
    Parihar, Anil Singh
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 9 (02): : 581 - 593
  • [38] Deep Reinforcement Learning for Sponsored Search Real-time Bidding
    Zhao, Jun
    Qiu, Guang
    Guan, Ziyu
    Zhao, Wei
    He, Xiaofei
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1021 - 1030
  • [39] Reinforcement Learning with Sequential Information Clustering in Real-Time Bidding
    Lu, Junwei
    Yang, Chaoqi
    Gao, Xiaofeng
    Wang, Liubin
    Li, Changcheng
    Chen, Guihai
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1633 - 1641
  • [40] Developing Real-Time Scheduling Policy by Deep Reinforcement Learning
    Bo, Zitong
    Qiao, Ying
    Leng, Chang
    Wang, Hongan
    Guo, Chaoping
    Zhang, Shaohui
    2021 IEEE 27TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS 2021), 2021, : 131 - 142