A Vibrating Mechanism to Prevent Neural Networks from Overfitting

被引:13
作者
Xiong, Jian [1 ]
Zhang, Kai [1 ]
Zhang, Hao [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
来源
2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC) | 2019年
关键词
D O I
10.1109/iwcmc.2019.8766500
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a Vibrating Mechanism, which can be understood as a continuous version of Dropout and can achieve a certain L1 regularization effect at the same time. In Dropout, the parameters are discarded with a probability obeying Bernoulli distribution. While in the proposed Vibrating Mechanism, the parameters are sampled from a truncated Gaussian distribution or uniform distribution. The mean of the distribution is the result trained by the previous iteration, and the standard deviation of the distribution is the absolute value of the previous iteration training result multiplied by a proportional coefficient. Besides, the performance improvement is theoretically analyzed from the perspective of hyperplane segmentation. The effectiveness of the proposed Vibrating Mechanism is demonstrated by applying it for a text classification task. We choose AG dataset for test. The result shows that the Vibrating Mechanism can achieve better classification accuracy without using L1 regularization or Dropout, which verifies the performance improvement of the Vibrating Mechanism.
引用
收藏
页码:1737 / 1742
页数:6
相关论文
共 19 条
  • [1] [Anonymous], CORR
  • [2] [Anonymous], 2016 C N AM ASS COMP
  • [3] [Anonymous], INT JOINT C ART INT
  • [4] [Anonymous], 2014, CONVOLUTIONAL NEURAL
  • [5] [Anonymous], 2015, ADV NEURAL INFORM PR
  • [6] [Anonymous], 2016, CoRR
  • [7] Collobert R, 2011, J MACH LEARN RES, V12, P2493
  • [8] Conneau A., 2016, VERY DEEP CONVOLUTIO
  • [9] MINIMIZING MULTIMODAL FUNCTIONS OF CONTINUOUS-VARIABLES WITH THE SIMULATED ANNEALING ALGORITHM
    CORANA, A
    MARCHESI, M
    MARTINI, C
    RIDELLA, S
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1987, 13 (03): : 262 - 280
  • [10] Grave E., 2017, P 15 C EUROPEAN CHAP