Multi-grade Deep Learning

被引：0

作者：

Xu, Yuesheng ^{[1
]}

机构：

[1] Old Dominion Univ, Dept Math & Stat, Norfolk, VA 23529 USA

来源：

COMMUNICATIONS ON APPLIED MATHEMATICS AND COMPUTATION | 2025年

基金：

美国国家科学基金会;

关键词：

Deep learning; Deep neural network (DDN); Multi-grade deep learning (MGDL); EMPIRICAL MODE DECOMPOSITION; ONLINE GRADIENT-METHOD; DETERMINISTIC CONVERGENCE; NETWORK;

D O I：

10.1007/s42967-024-00474-y

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

Deep learning requires solving a nonconvex optimization problem of a large size to learn a deep neural network (DNN). The current deep learning model is of a single-grade, that is, it trains a DNN end-to-end, by solving a single nonconvex optimization problem. When the layer number of the neural network is large, it is computationally challenging to carry out such a task efficiently. The complexity of the task comes from learning all weight matrices and bias vectors from one single nonconvex optimization problem of a large size. Inspired by the human education process which arranges learning in grades, we propose a multi-grade learning model: instead of solving one single optimization problem of a large size, we successively solve a number of optimization problems of small sizes, which are organized in grades, to learn a shallow neural network (a network having a few hidden layers) for each grade. Specifically, the current grade is to learn the leftover from the previous grade. In each of the grades, we learn a shallow neural network stacked on the top of the neural network, learned in the previous grades, whose parameters remain unchanged in training of the current and future grades. By dividing the task of learning a DDN into learning several shallow neural networks, one can alleviate the severity of the nonconvexity of the original optimization problem of a large size. When all grades of the learning are completed, the final neural network learned is a stair-shape neural network, which is the superposition of networks learned from all grades. Such a model enables us to learn a DDN much more effectively and efficiently. Moreover, multi-grade learning naturally leads to adaptive learning. We prove that in the context of function approximation if the neural network generated by a new grade is nontrivial, the optimal error of a new grade is strictly reduced from the optimal error of the previous grade. Furthermore, we provide numerical examples which confirm that the proposed multi-grade model outperforms significantly the standard single-grade model and is much more robust to noise than the single-grade model. They include three proof-of-concept examples, classification on two benchmark data sets MNIST and Fashion MNIST with two noise rates, which is to find classifiers, functions of 784 dimensions, and as well as numerical solutions of the one-dimensional Helmholtz equation.

引用

页数：52

共 50 条

[1] Deep learning for multi-grade brain tumor detection and classification: a prospective survey
Bhagyalaxmi, K.
Dwarakanath, B.
Reddy, P. Vijaya Pal
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (25) : 65889 - 65911
[2] Teaching and Learning in Multi-Grade Classrooms: The LEPO Framework
Msimanga, Mothofela R.
AFRICA EDUCATION REVIEW, 2020, 17 (03) : 123 - 141
[3] MULTI-GRADE DIESEL OIL
MCELROY, J
AUTOMOTIVE INDUSTRIES, 1978, 158 (15): : 19 - &
[4] INTEGRATING MULTI-GRADE COLLABORATIVE LEARNING PEDAGOGY INTO DESIGN STUDIOS
Ismail, Ayman Mohamed
Soliman, Mona Hassan
ARCHNET-IJAR INTERNATIONAL JOURNAL OF ARCHITECTURAL RESEARCH, 2010, 4 (2-3) : 201 - 215
[5] Enhanced deep-joint segmentation with deep learning networks of glioma tumor for multi-grade classification using MR images
S Divya
L Padma Suresh
A John
Pattern Analysis and Applications, 2022, 25 : 891 - 911
[6] Enhanced deep-joint segmentation with deep learning networks of glioma tumor for multi-grade classification using MR images
Divya, S.
Padma Suresh, L.
John, A.
PATTERN ANALYSIS AND APPLICATIONS, 2022, 25 (04) : 891 - 911
[7] CONTROVERSY VERSUS CONCURRENCE SEEKING IN MULTI-GRADE AND SINGLE-GRADE LEARNING GROUPS
JOHNSON, DW
JOHNSON, R
PIERSON, WT
LYONS, V
JOURNAL OF RESEARCH IN SCIENCE TEACHING, 1985, 22 (09) : 835 - 848
[8] Effectiveness of Multi-Grade Classes: Cooperative Learning as a Key Element of Success
Kadivar, Parvin
Nejad, Shokooh Navabi
Emamzade, Zahra Madadi
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 8, 2005, 8 : 169 - 172
[9] Resource materials for multi-grade teaching
Vithanapathirana, Manjula
INTERNATIONAL JOURNAL OF EDUCATIONAL DEVELOPMENT, 2007, 27 (01) : 115 - 116
[10] The conundrum of multi-grade Teaching in Zimbabwe's satellite primary schools: Quality multi-grade Education crisis
Jakachira, Godfrey
Muchabaiwa, Wonder
COGENT EDUCATION, 2023, 10 (02):

← 1 2 3 4 5 →