An Analytic End-to-End Collaborative Deep Learning Algorithm

被引：2

作者：

Li, Sitan ^{[1
]}

Cheah, Chien Chern ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Nanyang Ave, Singapore 639798, Singapore

来源：

IEEE CONTROL SYSTEMS LETTERS | 2023年 / 7卷

关键词：

Deep learning; sigmoid; robot kinematics; THEORETICAL FRAMEWORK;

D O I：

10.1109/LCSYS.2023.3292034

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In most control applications, theoretical analysis of the systems is crucial in ensuring stability or convergence, so as to ensure safe and reliable operations and also to gain a better understanding of the systems for further developments. However, most current deep learning methods are black-box approaches that are more focused on empirical studies. Recently, some results have been obtained for convergence analysis of end-to end deep learning based on non-smooth ReLU activation functions, which may result in chattering for control tasks. This letter presents a convergence analysis for end-to-end deep learning of fully connected neural networks (FNN) with smooth activation functions. The proposed method therefore avoids any potential chattering problem, and it also does not easily lead to gradient vanishing problems. The proposed End-to-End algorithm trains multiple two-layer fully connected networks concurrently and collaborative learning is used to further combine their strengths to improve accuracy. A classification case study based on fully connected networks and MNIST dataset is presented to demonstrate the performance of the proposed approach. In addition, an online kinematics control task of a UR5e robot arm is formulated to illustrate the regression approximation and online updating ability of the proposed algorithm.

引用

页码：3024 / 3029

页数：6

共 13 条

[1]

Allen-Zhu Z, 2019, PR MACH LEARN RES, V97

[2]

Du SS, 2019, 36 INT C MACHINE LEA, V97

[3] An analytic layer-wise deep learning framework with applications to robotics [J].

Huu-Thiet Nguyen ;

Chien Chern Cheah ;

Kar-Ann Toh .

AUTOMATICA, 2022, 135

[4] A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks [J].

Huu-Thiet Nguyen ;

Li, Sitan ;

Cheah, Chien Chern .

IEEE ACCESS, 2022, 10 :14270-14287

[5] Deep learning [J].

LeCun, Yann ;

Bengio, Yoshua ;

Hinton, Geoffrey .

NATURE, 2015, 521 (7553) :436-444

[6]

Li ST, 2023, Arxiv, DOI arXiv:2305.18594

[7] A Theoretical Framework for End-to-End Learning of Deep Neural Networks With Applications to Robotics [J].

Li, Sitan ;

Nguyen, Huu-Thiet ;

Cheah, Chien Chern .

IEEE ACCESS, 2023, 11 :21992-22006

[8] Lyapunov-Derived Control and Adaptive Update Laws for Inner and Outer Layer Weights of a Deep Neural Network [J].

Patil, Omkar Sudhir ;

Le, Duc M. ;

Greene, Max L. ;

Dixon, Warren E. .

IEEE CONTROL SYSTEMS LETTERS, 2022, 6 :1855-1860

[9] Effects of depth, width, and initialization: A convergence analysis of layer-wise training for deep linear neural networks [J].

Shin, Yeonjong .

ANALYSIS AND APPLICATIONS, 2022, 20 (01) :73-119

[10] Review of Deep Learning Algorithms and Architectures [J].

Shrestha, Ajay ;

Mahmood, Ausif .

IEEE ACCESS, 2019, 7 :53040-53065

← 1 2 →