A comprehensive survey on regularization strategies in machine learning

被引:118
|
作者
Tian, Yingjie [1 ,3 ,4 ]
Zhang, Yuqi [2 ,3 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[4] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Overfitting; Generalization; Regularization; Machine learning; COVARIANCE-MATRIX ESTIMATION; P-LAPLACIAN REGULARIZATION; FEATURE-SELECTION; SPARSE REGULARIZATION; VARIABLE SELECTION; NEURAL-NETWORKS; ROBUST PCA; IMAGE; REGRESSION; APPROXIMATION;
D O I
10.1016/j.inffus.2021.11.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning, the model is not as complicated as possible. Good generalization ability means that the model not only performs well on the training data set, but also can make good prediction on new data. Regularization imposes a penalty on model's complexity or smoothness, allowing for good generalization to unseen data even when training on a finite training set or with an inadequate iteration. Deep learning has developed rapidly in recent years. Then the regularization has a broader definition: regularization is a technology aimed at improving the generalization ability of a model. This paper gave a comprehensive study and a state-of-the-art review of the regularization strategies in machine learning. Then the characteristics and comparisons of regularizations were presented. In addition, it discussed how to choose a regularization for the specific task. For specific tasks, it is necessary for regularization technology to have good mathematical characteristics. Meanwhile, new regularization techniques can be constructed by extending and combining existing regularization techniques. Finally, it concluded current opportunities and challenges of regularization technologies, as well as many open concerns and research trends.
引用
收藏
页码:146 / 166
页数:21
相关论文
共 50 条
  • [41] A Comprehensive Survey on Machine Learning using in Software Defined Networks (SDN)
    Sahar Faezi
    Alireza Shirmarz
    Human-Centric Intelligent Systems, 2023, 3 (3): : 312 - 343
  • [42] A Comprehensive Survey on Training Acceleration for Large Machine Learning Models in IoT
    Wang, Haozhao
    Qu, Zhihao
    Zhou, Qihua
    Zhang, Haobo
    Luo, Boyuan
    Xu, Wenchao
    Guo, Song
    Li, Ruixuan
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (02) : 939 - 963
  • [43] A comprehensive survey on machine learning applications for drilling and blasting in surface mining
    Munagala, Venkat
    Thudumu, Srikanth
    Logothetis, Irini
    Bhandari, Sushil
    Vasa, Rajesh
    Mouzakis, Kon
    MACHINE LEARNING WITH APPLICATIONS, 2024, 15
  • [44] Adversarial Machine Learning for Network Intrusion Detection Systems: A Comprehensive Survey
    He, Ke
    Kim, Dan Dongseong
    Asghar, Muhammad Rizwan
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2023, 25 (01): : 538 - 566
  • [45] Machine Learning in Short-Reach Optical Systems: A Comprehensive Survey
    Shao, Chen
    Giacoumidis, Elias
    Billah, Syed Moktacim
    Li, Shi
    Li, Jialei
    Sahu, Prashasti
    Richter, Andre
    Faerber, Michael
    Kaefer, Tobias
    PHOTONICS, 2024, 11 (07)
  • [46] A Comprehensive Survey of Machine Learning Methodologies with Emphasis in Water Resources Management
    Drogkoula, Maria
    Kokkinos, Konstantinos
    Samaras, Nicholas
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [47] Leveraging Machine Learning and Big Data for Smart Buildings: A Comprehensive Survey
    Qolomany, Basheer
    Al-Fuqaha, Ala
    Gupta, Ajay
    Benhaddou, Driss
    Alwajidi, Safaa
    Qadir, Junaid
    Fong, Alvis C.
    IEEE ACCESS, 2019, 7 : 90316 - 90356
  • [48] Topologies in distributed machine learning: Comprehensive survey, recommendations and future directions
    Liu, Ling
    Zhou, Pan
    Sun, Gang
    Chen, Xi
    Wu, Tao
    Yu, Hongfang
    Guizani, Mohsen
    NEUROCOMPUTING, 2024, 567
  • [49] WiFi-Based Human Identification with Machine Learning: A Comprehensive Survey
    Mosharaf, Manal
    Kwak, Jae B.
    Choi, Wooyeol
    SENSORS, 2024, 24 (19)
  • [50] Comprehensive Survey of Using Machine Learning in the COVID-19 Pandemic
    El-Rashidy, Nora
    Abdelrazik, Samir
    Abuhmed, Tamer
    Amer, Eslam
    Ali, Farman
    Hu, Jong-Wan
    El-Sappagh, Shaker
    DIAGNOSTICS, 2021, 11 (07)