Minimum sample size determination of vibration signals in machine learning approach to fault diagnosis using power analysis

被引:20
作者
Indira, V. [2 ]
Vasanthakumari, R. [3 ]
Sugumaran, V. [1 ]
机构
[1] SRM Univ, Dept Mechatron Engn, Kattankulathur, India
[2] Sri Manakula Vinayagar Engn Coll, Dept Math, Madagadipet, Puducherry, India
[3] Villianur Coll Women, Dept Math, Villianur, Puducherry, India
关键词
Fault diagnosis; Machine learning; Power analysis; Vibration signals; Minimum sample size; Statistical features; LOGISTIC-REGRESSION; CLINICAL-TRIALS; COMPARING; EXPRESSION PATTERNS; NEURAL-NETWORKS; GEAR FAILURE; INTERIM DATA; REQUIREMENTS; ODDS; EQUIVALENCE;
D O I
10.1016/j.eswa.2010.06.068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The machine learning approach to fault diagnosis consists of a chain of activities such as data acquisition, feature extraction, feature selection and classification. Each one is equally important in fault diagnosis. As machine learning is a soft science, there is a lot of scope for finding mathematical reasoning which otherwise researchers do it arbitrarily or heuristically. Minimum number of samples required to separate faulty conditions, with statistical stability is one such important factor. This paper provides a method for determination of minimum sample size using power analysis. A typical bearing fault diagnosis problem is taken as a case for illustration and the results are compared with that of entropy-based algorithm (J48) for determining minimum sample size. The results will serve as a guideline for researchers working in fault diagnosis area to choose appropriate sample size. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:8650 / 8658
页数:9
相关论文
共 71 条
[2]   INTERNAL PILOT-STUDIES FOR ESTIMATING SAMPLE-SIZE [J].
BIRKETT, MA ;
DAY, SJ .
STATISTICS IN MEDICINE, 1994, 13 (23-24) :2455-2463
[3]   ON THE USE OF A PILOT SAMPLE FOR SAMPLE-SIZE DETERMINATION [J].
BROWNE, RH .
STATISTICS IN MEDICINE, 1995, 14 (17) :1933-1940
[4]   Statistical methodology .1. Incorporating the prevalence of disease into the sample size calculation for sensitivity and specificity [J].
Buderer, NMF .
ACADEMIC EMERGENCY MEDICINE, 1996, 3 (09) :895-900
[5]   SAMPLE-SIZE AND POWER DETERMINATION FOR A BINARY OUTCOME AND AN ORDINAL EXPOSURE WHEN LOGISTIC-REGRESSION ANALYSIS IS PLANNED [J].
BULL, SB .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1993, 137 (06) :676-684
[6]   IMPROVED APPROXIMATE FORMULA FOR CALCULATING SAMPLE SIZES FOR COMPARING 2 BINOMIAL DISTRIBUTIONS [J].
CASAGRANDE, JT ;
PIKE, MC ;
SMITH, PG .
BIOMETRICS, 1978, 34 (03) :483-486
[7]   Application of wavelets and neural networks to diagnostic system development, 1, feature extraction [J].
Chen, BH ;
Wang, XZ ;
Yang, SH ;
McGreavy, C .
COMPUTERS & CHEMICAL ENGINEERING, 1999, 23 (07) :899-906
[8]  
Cohen J., 1988, Statistical power analysis for the behavioral sciences, VSecond
[9]   SAMPLE-SIZE ESTIMATION FOR COMPARING 2 OR MORE TREATMENT GROUPS IN CLINICAL-TRIALS [J].
DAY, SJ ;
GRAHAM, DF .
STATISTICS IN MEDICINE, 1991, 10 (01) :33-43
[10]   A GOODNESS-OF-FIT APPROACH TO INFERENCE PROCEDURES FOR THE KAPPA-STATISTIC - CONFIDENCE-INTERVAL CONSTRUCTION, SIGNIFICANCE-TESTING AND SAMPLE-SIZE ESTIMATION [J].
DONNER, A ;
ELIASZIW, M .
STATISTICS IN MEDICINE, 1992, 11 (11) :1511-1519