Mining the breast cancer pattern using artificial neural networks and multivariate adaptive regression splines

被引:129
作者
Chou, SM
Lee, TS
Shao, YE
Chen, IF
机构
[1] Fu Jen Catholic Univ, Grad Inst Management, Taipei 24205, Taiwan
[2] Chang Jung Christian Univ, Dept Nursing, Tainan, Taiwan
[3] Fu Jen Catholic Univ, Dept Stat & Informat Sci, Taipei, Taiwan
关键词
data mining; breast cancer; classification; neural networks; multivariate adaptive regression splines;
D O I
10.1016/j.eswa.2003.12.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining is a very popular technique and has been widely applied in different areas these days. The artificial neural network has become a very popular alternative in prediction and classification tasks due to its associated memory characteristics and generalization capability. However. the relative importance of potential input variables and the long training process have often been criticized and hence limited its application in handling classification problems. The objective of the proposed study is to explore the performance of data classification by integrating artificial neural networks with the multivariate adaptive regression splines (MARS) approach. The rationale under the analyses is firstly to use MARS in modeling the classification problem, then the obtained significant variables are used as the input variables of the designed neural networks model. To demonstrate the inclusion of the obtained important variables from MARS would improve the classification accuracy of the networks, diagnostic tasks are performed on one fine needle aspiration cytology breast cancer data set. As the results reveal. the proposed integrated approach outperforms the results using discriminant analysis, artificial neural networks and multivariate adaptive regression splines and hence provides an efficient alternative in handling breast cancer diagnostic problems. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:133 / 142
页数:10
相关论文
共 62 条
  • [11] Using neural networks for data mining
    Craven, MW
    Shavlik, JW
    [J]. FUTURE GENERATION COMPUTER SYSTEMS, 1997, 13 (2-3) : 211 - 229
  • [12] SMOOTHING NOISY DATA WITH SPLINE FUNCTIONS
    WAHBA, G
    [J]. NUMERISCHE MATHEMATIK, 1975, 24 (05) : 383 - 393
  • [13] CURT H, 1995, INTELLIGENT SOFTWARE, P1
  • [14] Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
  • [15] Davies P. C., 1994, NeurovestJ, V5, P21
  • [16] Forecasting exchange rates using TSMARS
    De Gooijer, JG
    Ray, BK
    Krager, H
    [J]. JOURNAL OF INTERNATIONAL MONEY AND FINANCE, 1998, 17 (03) : 513 - 534
  • [17] A comparison of neural networks and linear scoring models in the credit union environment
    Desai, VS
    Crook, JN
    Overstreet, GA
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1996, 95 (01) : 24 - 37
  • [18] Dillon W.R., 1984, MULTIVARIATE ANAL ME
  • [19] VARIABILITY IN RADIOLOGISTS INTERPRETATIONS OF MAMMOGRAMS
    ELMORE, JG
    WELLS, CK
    LEE, CH
    HOWARD, DH
    FEINSTEIN, AR
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 1994, 331 (22) : 1493 - 1499
  • [20] The KDD process for extracting useful knowledge from volumes of data
    Fayyad, U
    PiatetskyShapiro, G
    Smyth, P
    [J]. COMMUNICATIONS OF THE ACM, 1996, 39 (11) : 27 - 34