How do humans acquire a meaningful understanding of the world with little to no supervision or semantic labels provided by the environment? Here we investigate embodiment with a closed loop between action and perception as one key component in this process. We take a close look at the representations learned by a deep reinforcement learning agent that is trained with high-dimensional visual observations collected in a 3D environment with very sparse rewards. We show that this agent learns stable representations of meaningful concepts such as doors without receiving any semantic labels. Our results show that the agent learns to represent the action relevant information, extracted from a simulated camera stream, in a wide variety of sparse activation patterns. The quality of the representations learned shows the strength of embodied learning and its advantages over fully supervised approaches. (C) 2020 The Authors. Published by Elsevier Ltd.
机构:
Queen Mary Univ London, Dept Elect Engn, Ctr Digital Mus, London E1 4NS, EnglandQueen Mary Univ London, Dept Elect Engn, Ctr Digital Mus, London E1 4NS, England
Plumbley, MD
Abdallah, SA
论文数: 0引用数: 0
h-index: 0
机构:
Queen Mary Univ London, Dept Elect Engn, Ctr Digital Mus, London E1 4NS, EnglandQueen Mary Univ London, Dept Elect Engn, Ctr Digital Mus, London E1 4NS, England
Abdallah, SA
Blumensath, T
论文数: 0引用数: 0
h-index: 0
机构:
Queen Mary Univ London, Dept Elect Engn, Ctr Digital Mus, London E1 4NS, EnglandQueen Mary Univ London, Dept Elect Engn, Ctr Digital Mus, London E1 4NS, England
Blumensath, T
Davies, ME
论文数: 0引用数: 0
h-index: 0
机构:
Queen Mary Univ London, Dept Elect Engn, Ctr Digital Mus, London E1 4NS, EnglandQueen Mary Univ London, Dept Elect Engn, Ctr Digital Mus, London E1 4NS, England
机构:
East China Normal Univ, MoE Engn Res Ctr Hardware Software Codesign Techno, Shanghai 200062, Peoples R ChinaEast China Normal Univ, MoE Engn Res Ctr Hardware Software Codesign Techno, Shanghai 200062, Peoples R China
Wei, Xian
Liu, Yingjie
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, MoE Engn Res Ctr Hardware Software Codesign Techno, Shanghai 200062, Peoples R ChinaEast China Normal Univ, MoE Engn Res Ctr Hardware Software Codesign Techno, Shanghai 200062, Peoples R China
Liu, Yingjie
Tang, Xuan
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Sch Commun & Elect Engn, Shanghai 200241, Peoples R ChinaEast China Normal Univ, MoE Engn Res Ctr Hardware Software Codesign Techno, Shanghai 200062, Peoples R China
Tang, Xuan
Yu, Shui
论文数: 0引用数: 0
h-index: 0
机构:
Univ Technol Sydney, Sch Comp Sci, Sydney, NSW 2007, AustraliaEast China Normal Univ, MoE Engn Res Ctr Hardware Software Codesign Techno, Shanghai 200062, Peoples R China
Yu, Shui
Chen, Mingsong
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, MoE Engn Res Ctr Hardware Software Codesign Techno, Shanghai 200062, Peoples R ChinaEast China Normal Univ, MoE Engn Res Ctr Hardware Software Codesign Techno, Shanghai 200062, Peoples R China