Data Dimension and Structure Effects in Predictive Performance of Deep Neural Networks

被引：2

作者：

Urda, Daniel ^{[1
]}

Jerez, Jose M. ^{[2
]}

Turias, Ignacio J. ^{[1
]}

机构：

[1] Univ Cadiz, Dept Comp Sci Engn, Cadiz, Spain

[2] Univ Malaga, Dept Comp Sci, Malaga, Spain

来源：

NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18) | 2018年 / 303卷

关键词：

deep learning; prior knowledge; predictive modelling; constraints; inference;

D O I：

10.3233/978-1-61499-900-3-361

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning provides a variety of neural network based models, known as Deep Neural Networks (DNNs), which are being successfully used in several domains to build highly accurate predictors from data. In particular, the predictive performance of a dense fully-connected multi-layer neural networks may vary depending on some factors. In this paper, 18 synthetic datasets were used to test the effect of data dimension and data structure on the predictive performance of a standard DNN and an architecture-constrained DNN (c-DNN) based on problem specific information. The results of the analysis showed that a c-DNN clearly outperforms a standard DNN in most of the cases considered. Moreover, it suggested that both adding constraints to the network architecture and having the lowest number of input features possible which are relevant to the problem addressed may have a positive impact in terms of reducing overfitting and getting better prediction results.

引用

页码：361 / 372

页数：12

共 17 条

[1]

Amodei D, 2016, PR MACH LEARN RES, V48

[2] Deep learning for computational biology [J].

Angermueller, Christof ;

Parnamaa, Tanel ;

Parts, Leopold ;

Stegle, Oliver .

MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (07)

[3]

[Anonymous], 2014, INFORM SCI, DOI DOI 10.1016/J.INS.2014.01.015

[4]

[Anonymous], 2017, NATURE, DOI DOI 10.1038/NATURE21056

[5]

[Anonymous], 2015, NATURE, DOI [10.1038/nature14539, 10.1038/, DOI 10.1038/NATURE14539]

[6] Scaling to very very large corpora for natural language disambiguation [J].

Banko, M ;

Brill, E .

39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, :26-33

[7]

Chollet Francois, 2017, R interface to keras

[8] Regularization Paths for Generalized Linear Models via Coordinate Descent [J].

Friedman, Jerome ;

Hastie, Trevor ;

Tibshirani, Rob .

JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22

[9] The Unreasonable Effectiveness of Data [J].

Halevy, Alon ;

Norvig, Peter ;

Pereira, Fernando .

IEEE INTELLIGENT SYSTEMS, 2009, 24 (02) :8-12

[10] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

← 1 2 →