Semi-supervised support vector regression based on self-training with label uncertainty: An application to virtual metrology in semiconductor manufacturing

被引:79
作者
Kang, Pilsung [1 ]
Kim, Dongil [2 ]
Cho, Sungzoon [3 ]
机构
[1] Korea Univ, Sch Ind Management Engn, Seoul 02841, South Korea
[2] Korea Inst Ind Technol, Smart Mfg Technol Grp, Cheonan 31056, South Korea
[3] Seoul Natl Univ, Dept Ind Engn, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Semi-supervised learning; Support vector regression; Probabilistic local reconstruction; Data generation; Virtual metrology; Semiconductor manufacturing; RECONSTRUCTION; SVM;
D O I
10.1016/j.eswa.2015.12.027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dataset size continues to increase and data are being collected from numerous applications. Because collecting labeled data is expensive and time consuming, the amount of unlabeled data is increasing. Semi-supervised learning (SSL) has been proposed to improve conventional supervised learning methods by training from both unlabeled and labeled data. In contrast to classification problems, the estimation of labels for unlabeled data presents added uncertainty for regression problems. In this paper, a semi supervised support vector regression (SS-SVR) method based on self-training is proposed. The proposed method addresses the uncertainty of the estimated labels for unlabeled data. To measure labeling uncertainty, the label distribution of the unlabeled data is estimated with two probabilistic local reconstruction (PLR) models. Then, the training data are generated by oversampling from the unlabeled data and their estimated label distribution. The sampling rate is different based on uncertainty. Finally, expected margin-based pattern selection (EMPS) is employed to reduce training complexity. We verify the proposed method with 30 regression datasets and a real-world problem: virtual metrology (VM) in semiconductor manufacturing. The experiment results show that the proposed method improves the accuracy by 8% compared with conventional supervised SVR, and the training time for the proposed method is 20% shorter than that of the benchmark methods. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:85 / 106
页数:22
相关论文
共 50 条
  • [21] Self-Training using Selection Network for Semi-supervised Learning
    Jeong, Jisoo
    Lee, Seungeui
    Kwak, Nojun
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 23 - 32
  • [22] An Auto-Adjustable Semi-Supervised Self-Training Algorithm
    Livieris, Ioannis E.
    Kanavos, Andreas
    Tampakas, Vassilis
    Pintelas, Panagiotis
    ALGORITHMS, 2018, 11 (09):
  • [23] Semi-Supervised Self-Training Method Based on an Optimum-Path Forest
    Li, Junnan
    Zhu, Qingsheng
    IEEE ACCESS, 2019, 7 : 36388 - 36399
  • [24] A semi-supervised self-training method based on density peaks and natural neighbors
    Zhao, Suwen
    Li, Junnan
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 2939 - 2953
  • [25] A semi-supervised self-training method based on density peaks and natural neighbors
    Suwen Zhao
    Junnan Li
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 2939 - 2953
  • [26] Graph-Based Self-Training for Semi-Supervised Deep Similarity Learning
    Wang, Yifan
    Huang, Yan
    Wang, Qicong
    Zhao, Chong
    Zhang, Zhenchang
    Chen, Jian
    SENSORS, 2023, 23 (08)
  • [27] Granulation-based self-training for the semi-supervised classification of remote-sensing images
    Aydav, Prem Shankar Singh
    Minz, Sonajharia
    GRANULAR COMPUTING, 2020, 5 (03) : 309 - 327
  • [28] On the characterization of noise filters for self-training semi-supervised in nearest neighbor classification
    Triguero, Isaac
    Saez, Jose A.
    Luengo, Julian
    Garcia, Salvador
    Herrera, Francisco
    NEUROCOMPUTING, 2014, 132 : 30 - 41
  • [29] Analysis of training data using clustering to improve semi-supervised self-training
    Piroonsup, N.
    Sinthupinyo, S.
    KNOWLEDGE-BASED SYSTEMS, 2018, 143 : 65 - 80
  • [30] Classwise Self-Paced Self-Training for Semi-Supervised Image Classification
    Lu, Cheng-Yu
    Hsu, Heng-Cheng
    Chiang, Chen-Kuo
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 753 - 758