The unreasonable effectiveness of deep learning in artificial intelligence

被引:191
作者
Sejnowski, Terrence J. [1 ,2 ]
机构
[1] Salk Inst Biol Studies, Computat Neurobiol Lab, La Jolla, CA 92037 USA
[2] Univ Calif San Diego, Div Biol Sci, La Jolla, CA 92093 USA
关键词
deep learning; artificial intelligence; neural networks; ALGORITHM; MEMORY; BACK;
D O I
10.1073/pnas.1907373117
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep learning networks have been trained to recognize speech, caption photographs, and translate text between languages at high levels of performance. Although applications of deep learning networks to real-world problems have become ubiquitous, our understanding of why they are so effective is lacking. These empirical results should not be possible according to sample complexity in statistics and nonconvex optimization theory. However, paradoxes in the training and effectiveness of deep learning networks are being investigated and insights are being found in the geometry of high-dimensional spaces. A mathematical theory of deep learning would illuminate how they function, allow us to assess the strengths and weaknesses of different network architectures, and lead to major improvements. Deep learning has provided natural ways for humans to communicate with digital devices and is foundational for building artificial general intelligence. Deep learning was inspired by the architecture of the cerebral cortex and insights into autonomy and general intelligence may be found in other brain regions that are essential for planning and survival, but major breakthroughs will be needed to achieve these goals.
引用
收藏
页码:30033 / 30038
页数:6
相关论文
共 36 条
[1]  
Abbott Edwin A, 1884, FLATLAND ROMANCE MAN
[2]  
ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
[3]  
[Anonymous], 2014, TECHNICAL REPORT ARX, DOI DOI 10.48550/ARXIV.1405.4604
[4]  
[Anonymous], 1969, Perceptrons
[5]  
[Anonymous], 2016, P 29 IEEE C COMPUTER
[6]   Lifelong Learning in Artificial Neural Networks [J].
Anthes, Gary .
COMMUNICATIONS OF THE ACM, 2019, 62 (06) :13-15
[7]  
Bartlett Peter L, 2019, ARXIV190611300
[8]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[9]  
Chomsky N., 1986, KNOWLEDGE LANGUAGE
[10]  
Churchland P, 2019, Conscience: The origins of moral intuition