Next-generation deep learning based on simulators and synthetic data

被引:81
作者
de Melo, Celso M. [1 ]
Torralba, Antonio [2 ]
Guibas, Leonidas [3 ]
DiCarlo, James [4 ]
Chellappa, Rama [5 ]
Hodgins, Jessica [6 ]
机构
[1] DEVCOM US Army Res Lab, Computat & Informat Sci Directorate, Playa Vista, CA 92101 USA
[2] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[3] Stanford Univ, Comp Sci Dept, Stanford, CA 94305 USA
[4] MIT, Dept Brain & Cognit Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] Johns Hopkins Univ, Elect & Comp Engn, Baltimore, MD USA
[6] Carnegie Mellon Univ, Comp Sci Dept, Pittsburgh, PA 15213 USA
关键词
MODELS;
D O I
10.1016/j.tics.2021.11.008
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Deep learning (DL) is being successfully applied across multiple domains, yet these models learn in a most artificial way: they require large quantities of labeled data to grasp even simple concepts. Thus, the main bottleneck is often access to supervised data. Here, we highlight a trend in a potential solution to this challenge: synthetic data. Synthetic data are becoming accessible due to progress in rendering pipelines, generative adversarial models, and fusion models. Moreover, advancements in domain adaptation techniques help close the statistical gap between synthetic and real data. Paradoxically, this artificial solution is also likely to enable more natural learning, as seen in biological systems, including continual, multimodal, and embodied learning. Complementary to this, simulators and deep neural networks (DNNs) will also have a critical role in providing insight into the cognitive and neural functioning of biological systems. We also review the strengths of, and opportunities and novel challenges associated with, synthetic data.
引用
收藏
页码:174 / 187
页数:14
相关论文
共 128 条
[1]   Augmented Reality Meets Computer Vision: Efficient Data Generation for Urban Driving Scenes [J].
Abu Alhaija, Hassan ;
Mustikovela, Siva Karthik ;
Mescheder, Lars ;
Geiger, Andreas ;
Rother, Carsten .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (09) :961-972
[2]   Automatic Target Recognition of Aerial Vehicles Based on Synthetic SAR Imagery Using Hybrid Stacked Denoising Auto-encoders [J].
Ahmadibeni, Ali ;
Jones, Branddon ;
Borooshak, Leila ;
Shirkhodaie, Amir .
ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXVII, 2020, 11393
[3]  
Akkaya I., 2019, Solving rubik's cube with a robot hand
[4]  
Aljundi R, 2019, ADV NEUR IN, V32
[5]  
Andrew GHoward., 2017, MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
[6]  
[Anonymous], 2020, ARXIV200704954
[7]   Look, Listen and Learn [J].
Arandjelovic, Relja ;
Zisserman, Andrew .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :609-617
[8]  
Azadi S, 2019, ARXIV191111357
[9]   Multimodal Machine Learning: A Survey and Taxonomy [J].
Baltrusaitis, Tadas ;
Ahuja, Chaitanya ;
Morency, Louis-Philippe .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) :423-443
[10]   Unsupervised Learning [J].
Barlow, H. B. .
NEURAL COMPUTATION, 1989, 1 (03) :295-311