Semi-supervised support vector regression based on self-training with label uncertainty: An application to virtual metrology in semiconductor manufacturing

被引:79
|
作者
Kang, Pilsung [1 ]
Kim, Dongil [2 ]
Cho, Sungzoon [3 ]
机构
[1] Korea Univ, Sch Ind Management Engn, Seoul 02841, South Korea
[2] Korea Inst Ind Technol, Smart Mfg Technol Grp, Cheonan 31056, South Korea
[3] Seoul Natl Univ, Dept Ind Engn, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Semi-supervised learning; Support vector regression; Probabilistic local reconstruction; Data generation; Virtual metrology; Semiconductor manufacturing; RECONSTRUCTION; SVM;
D O I
10.1016/j.eswa.2015.12.027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dataset size continues to increase and data are being collected from numerous applications. Because collecting labeled data is expensive and time consuming, the amount of unlabeled data is increasing. Semi-supervised learning (SSL) has been proposed to improve conventional supervised learning methods by training from both unlabeled and labeled data. In contrast to classification problems, the estimation of labels for unlabeled data presents added uncertainty for regression problems. In this paper, a semi supervised support vector regression (SS-SVR) method based on self-training is proposed. The proposed method addresses the uncertainty of the estimated labels for unlabeled data. To measure labeling uncertainty, the label distribution of the unlabeled data is estimated with two probabilistic local reconstruction (PLR) models. Then, the training data are generated by oversampling from the unlabeled data and their estimated label distribution. The sampling rate is different based on uncertainty. Finally, expected margin-based pattern selection (EMPS) is employed to reduce training complexity. We verify the proposed method with 30 regression datasets and a real-world problem: virtual metrology (VM) in semiconductor manufacturing. The experiment results show that the proposed method improves the accuracy by 8% compared with conventional supervised SVR, and the training time for the proposed method is 20% shorter than that of the benchmark methods. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:85 / 106
页数:22
相关论文
共 50 条
  • [1] Self-training with dual uncertainty for semi-supervised MRI image segmentation
    Qiu, Zhanhong
    Gan, Haitao
    Shi, Ming
    Huang, Zhongwei
    Yang, Zhi
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
  • [2] ST-LP: self-training and label propagation for semi-supervised classification
    Chih-Wen Lin
    Chen-Kuo Chiang
    Yu-An Wang
    Yue-Lin Yang
    Hao-Ting Li
    Tzu-Chieh Lin
    Multimedia Tools and Applications, 2024, 83 (41) : 89335 - 89353
  • [3] A self-training semi-supervised support vector machine algorithm and its applications in brain computer interface
    Li, Yuanqing
    Li, Huiqi
    Guan, Cuntai
    Chin, Zhengyang
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 385 - 388
  • [4] A self-training hierarchical prototype-based approach for semi-supervised classification
    Gu, Xiaowei
    INFORMATION SCIENCES, 2020, 535 : 204 - 224
  • [5] Semi-supervised self-training for decision tree classifiers
    Tanha, Jafar
    van Someren, Maarten
    Afsarmanesh, Hamideh
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (01) : 355 - 370
  • [6] Fast semi-supervised self-training algorithm based on data editing
    Li, Bing
    Wang, Jikui
    Yang, Zhengguo
    Yi, Jihai
    Nie, Feiping
    INFORMATION SCIENCES, 2023, 626 : 293 - 314
  • [7] Semi-supervised Continual Learning with Meta Self-training
    Ho, Stella
    Liu, Ming
    Du, Lan
    Li, Yunfeng
    Gao, Longxiang
    Gao, Shang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4024 - 4028
  • [8] Semi-supervised self-training for decision tree classifiers
    Jafar Tanha
    Maarten van Someren
    Hamideh Afsarmanesh
    International Journal of Machine Learning and Cybernetics, 2017, 8 : 355 - 370
  • [9] SEMI-SUPERVISED FACE RECOGNITION WITH LDA SELF-TRAINING
    Zhao, Xuran
    Evans, Nicholas
    Dugelay, Jean-Luc
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [10] Federated Self-training for Semi-supervised Audio Recognition
    Tsouvalas, Vasileios
    Saeed, Aaqib
    Ozcelebi, Tanir
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (06)