Online perceptual learning and natural language acquisition for autonomous robots

被引:6
作者
Alomari, Muhannad [1 ]
Li, Fangjun [1 ]
Hogg, David C. [1 ]
Cohn, Anthony G. [1 ,2 ,3 ,4 ]
机构
[1] Univ Leeds, Sch Comp, Leeds, W Yorkshire, England
[2] Qingdao Univ Sci & Technol, Sch Mech & Elect Engn, Luzhong Inst Safety Environm Protect Engeering &, Qingdao, Peoples R China
[3] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China
[4] Shandong Univ, Sch Civil Engn, Jinan, Peoples R China
关键词
Language and vision; Language acquisition; Language grounding; Grammar induction; MODELS;
D O I
10.1016/j.artint.2021.103637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, the problem of bootstrapping knowledge in language and vision for autonomous robots is addressed through novel techniques in grammar induction and word grounding to the perceptual world. In particular, we demonstrate a system, called OLAV, which is able, for the first time, to (1) learn to form discrete concepts from sensory data; (2) ground language (n-grams) to these concepts; (3) induce a grammar for the language being used to describe the perceptual world; and moreover to do all this incrementally, without storing all previous data. The learning is achieved in a loosely-supervised manner from raw linguistic and visual data. Moreover, the learnt model is transparent, rather than a black-box model and is thus open to human inspection. The visual data is collected using three different robotic platforms deployed in real-world and simulated environments and equipped with different sensing modalities, while the linguistic data is collected using online crowdsourcing tools and volunteers. The analysis performed on these robots demonstrates the effectiveness of the framework in learning visual concepts, language groundings and grammatical structure in these three online settings. (c) 2021 Published by Elsevier B.V.
引用
收藏
页数:32
相关论文
共 81 条
[11]  
[Anonymous], 1972, Complexity of Computer Computations
[12]  
[Anonymous], 1975, Graph Theory: An Algorithmic Approach
[13]  
[Anonymous], 2016, P IJCAI
[14]  
[Anonymous], 1996, Geographic objects with indeterminate boundaries
[15]  
ArkanCani O., 2019, ARXIV190413324
[16]   Driving Under the Influence (of Language) [J].
Barrett, Daniel Paul ;
Bronikowski, Scott Alan ;
Yu, Haonan ;
Siskind, Jeffrey Mark .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (07) :2668-2683
[17]  
Beetz Michael, 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2011), P529, DOI 10.1109/Humanoids.2011.6100855
[18]   Defining Relations: A General Incremental Approach with Spatial and Temporal Case Studies [J].
Bennett, Brandon ;
Du, Heshan ;
Alvarez, Lucia Gomez ;
Cohn, Anthony G. .
FORMAL ONTOLOGY IN INFORMATION SYSTEMS, 2016, 283 :23-36
[19]  
Bisk Y, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P8718
[20]  
Burchfiel B, 2017, ROBOTICS: SCIENCE AND SYSTEMS XIII