Co-Clustering Structural Temporal Data with Applications to Semiconductor Manufacturing

被引:8
作者
Zhu, Yada [1 ,3 ]
He, Jingrui [2 ]
机构
[1] IBM Res, Yorktown Hts, NY 10598 USA
[2] Arizona State Univ, Comp Sci & Engn, 699 S Mill Ave, Tempe, AZ 85281 USA
[3] IBM TJ Watson Res Ctr, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
关键词
Co-clustering; semiconductor; structural; temporal; FAULT-DETECTION; TIME; PCA;
D O I
10.1145/2875427
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent years have witnessed data explosion in semiconductor manufacturing due to advances in instrumentation and storage techniques. The large amount of data associated with process variables monitored over time form a rich reservoir of information, which can be used for a variety of purposes, such as anomaly detection, quality control, and fault diagnostics. In particular, following the same recipe for a certain Integrated Circuit device, multiple tools and chambers can be deployed for the production of this device, during which multiple time series can be collected, such as temperature, impedance, gas flow, electric bias, etc. These time series naturally fit into a two-dimensional array (matrix), i.e., each element in this array corresponds to a time series for one process variable from one chamber. To leverage the rich structural information in such temporal data, in this article, we propose a novel framework named C-Struts to simultaneously cluster on the two dimensions of this array. In this framework, we interpret the structural information as a set of constraints on the cluster membership, introduce an auxiliary probability distribution accordingly, and design an iterative algorithm to assign each time series to a certain cluster on each dimension. Furthermore, we establish the equivalence between C-Struts and a generic optimization problem, which is able to accommodate various distance functions. Extensive experiments on synthetic, benchmark, as well as manufacturing datasets demonstrate the effectiveness of the proposed method.
引用
收藏
页数:18
相关论文
共 38 条
[1]  
[Anonymous], 2005, P 31 INT C VERY LARG
[2]  
[Anonymous], 2003, Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining
[3]  
[Anonymous], 2006, P 23 INT C MACHINE L, DOI DOI 10.1145/1143844.1143918
[4]  
[Anonymous], P ACM SIGKDD INT C K
[5]  
Banerjee A, 2007, J MACH LEARN RES, V8, P1919
[6]  
Chakrabarti D, 2004, LECT NOTES ARTIF INT, V3202, P112
[7]   Spatiotemporal Pattern Modeling for Fault Detection and Classification in Semiconductor Manufacturing [J].
Chang, Hyung Jin ;
Song, Dong Sung ;
Kim, Pyo Jae ;
Choi, Jin Young .
IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2012, 25 (01) :72-82
[8]  
Chen YP, 2013, 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), P383
[9]  
Cho H, 2004, SIAM PROC S, P114
[10]   Scaling and time warping in time series querying [J].
Fu, Ada Wai-Chee ;
Keogh, Eamonn ;
Lau, Leo Yung Hang ;
Ratanamahatana, Chotirat Ann ;
Wong, Raymond Chi-Wing .
VLDB JOURNAL, 2008, 17 (04) :899-921