Unsupervised Post-Tuning of Deep Neural Networks

被引:2
作者
Cerisara, Christophe [1 ]
Caillon, Paul [1 ]
Le Berre, Guillaume [1 ]
机构
[1] Univ Lorraine, CNRS, LORIA, F-54000 Nancy, France
来源
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2021年
关键词
deep learning; unsupervised training; regularization; natural language processing;
D O I
10.1109/IJCNN52387.2021.9534198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose in this work a new unsupervised training procedure that is most effective when it is applied after supervised training and fine-tuning of deep neural network classifiers. While standard regularization techniques combat overfitting by means that are unrelated to the target classification loss, such as by minimizing the L2 norm or by adding noise either in the data, model or process, the proposed unsupervised training loss reduces overfitting by optimizing the true classifier risk. The proposed approach is evaluated on several tasks of increasing difficulty and varying conditions: unsupervised training, post-tuning and anomaly detection. It is also tested both on simple neural networks, such as small multi-layer perceptron, and complex Natural Language Processing models, e.g., pretrained BERT embeddings. Experimental results confirm the theory and show that the proposed approach gives the best results in post-tuning conditions, i.e., when applied after supervised training and fine-tuning.
引用
收藏
页数:8
相关论文
共 32 条
[1]  
[Anonymous], 2017, P 2017 SIAM INT C DA
[2]  
[Anonymous], 2020, FINDINGS ASS COMPUTA, DOI DOI 10.1145/3397271.3401302
[3]   AUTOMATED LEARNING OF DECISION RULES FOR TEXT CATEGORIZATION [J].
APTE, C ;
DAMERAU, F ;
WEISS, SM .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1994, 12 (03) :233-251
[4]  
Balasubramanian K, 2011, J MACH LEARN RES, V12, P3119
[5]   (Machine) learning to do more with less [J].
Cohen, Timothy ;
Freytsis, Marat ;
Ostdiek, Bryan .
JOURNAL OF HIGH ENERGY PHYSICS, 2018, (02)
[6]  
Conneau A, 2018, PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), P1699
[7]  
Conneau Alexis, 2017, P C EMP METH NAT LAN, P670, DOI DOI 10.18653/V1/D17-1070
[8]  
Durica M, 2017, GLOBALIZATION AND ITS SOCIO-ECONOMIC CONSEQUENCES, PTS I - VI, P378
[9]  
Erhan D, 2010, J MACH LEARN RES, V11, P625
[10]  
Huggingface, 2020, TRANSF RES