Semi-supervised support vector regression based on self-training with label uncertainty: An application to virtual metrology in semiconductor manufacturing

被引:79
作者
Kang, Pilsung [1 ]
Kim, Dongil [2 ]
Cho, Sungzoon [3 ]
机构
[1] Korea Univ, Sch Ind Management Engn, Seoul 02841, South Korea
[2] Korea Inst Ind Technol, Smart Mfg Technol Grp, Cheonan 31056, South Korea
[3] Seoul Natl Univ, Dept Ind Engn, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
Semi-supervised learning; Support vector regression; Probabilistic local reconstruction; Data generation; Virtual metrology; Semiconductor manufacturing; RECONSTRUCTION; SVM;
D O I
10.1016/j.eswa.2015.12.027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dataset size continues to increase and data are being collected from numerous applications. Because collecting labeled data is expensive and time consuming, the amount of unlabeled data is increasing. Semi-supervised learning (SSL) has been proposed to improve conventional supervised learning methods by training from both unlabeled and labeled data. In contrast to classification problems, the estimation of labels for unlabeled data presents added uncertainty for regression problems. In this paper, a semi supervised support vector regression (SS-SVR) method based on self-training is proposed. The proposed method addresses the uncertainty of the estimated labels for unlabeled data. To measure labeling uncertainty, the label distribution of the unlabeled data is estimated with two probabilistic local reconstruction (PLR) models. Then, the training data are generated by oversampling from the unlabeled data and their estimated label distribution. The sampling rate is different based on uncertainty. Finally, expected margin-based pattern selection (EMPS) is employed to reduce training complexity. We verify the proposed method with 30 regression datasets and a real-world problem: virtual metrology (VM) in semiconductor manufacturing. The experiment results show that the proposed method improves the accuracy by 8% compared with conventional supervised SVR, and the training time for the proposed method is 20% shorter than that of the benchmark methods. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:85 / 106
页数:22
相关论文
共 50 条
  • [31] SST: Self-training with self-adaptive thresholding for semi-supervised learning
    Zhao, Shuai
    Huang, Heyan
    Li, Xinge
    Chen, Xiaokang
    Wang, Rui
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (05)
  • [32] Automatic Adjustment of Confidence Values in Self-training Semi-supervised Method
    Ovidio Vale, Karliane M.
    Canuto, Anne Magaly de P.
    Santos, Araken de Medeiros
    Gorgonio, Flavius da Luz e
    Tavares, Alan de M.
    Gorgnio, Arthur C.
    Alves, Cainan T.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [33] Semi-Supervised Gastrointestinal Stromal Tumor Detection via Self-Training
    Yang, Qi
    Cao, Ziran
    Jiang, Yaling
    Sun, Hanbo
    Gu, Xiaokang
    Xie, Fei
    Miao, Fei
    Gao, Gang
    ELECTRONICS, 2023, 12 (04)
  • [34] Semi-supervised support vector regression based on data similarity and its application to rock-mechanics parameters estimation
    Chen, Xi
    Cao, Weihua
    Gan, Chao
    Ohyama, Yasuhiro
    She, Jinhua
    Wu, Min
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 104
  • [35] Semi-Supervised Learning for Fine-Grained Classification With Self-Training
    Nartey, Obed Tettey
    Yang, Guowu
    Wu, Jinzhao
    Asare, Sarpong Kwadwo
    IEEE ACCESS, 2020, 8 : 2109 - 2121
  • [36] Local Binary Ensemble based Self-training for Semi-supervised Classification of Hyperspectral Remote Sensing Images
    Singh, Pangambam Sendash
    Singh, Vijendra Pratap
    Pandey, Manish Kumar
    Karthikeyan, Subbiah
    COMPUTACION Y SISTEMAS, 2020, 24 (02): : 497 - 509
  • [37] Self-training involving semantic-space finetuning for semi-supervised multi-label document classification
    Zhewei Xu
    Mizuho Iwaihara
    International Journal on Digital Libraries, 2024, 25 : 25 - 39
  • [38] Self-training involving semantic-space finetuning for semi-supervised multi-label document classification
    Xu, Zhewei
    Iwaihara, Mizuho
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2024, 25 (01) : 25 - 39
  • [39] Integrating Semantic-Space Finetuning and Self-Training for Semi-Supervised Multi-label Text Classification
    Xu, Zhewei
    Iwaihara, Mizuho
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 249 - 263
  • [40] A CNN-Based Semi-supervised Self-training Method for Robust Underwater Fish Recognition
    Li, Tanqing
    Zhao, Zhili
    Zhang, Hengyu
    Li, Kun
    Lv, Wenjun
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 1553 - 1559