Self-organizing map for cluster analysis of a breast cancer database

被引:79
作者
Markey, MK
Lo, JY
Tourassi, GD
Floyd, CE
机构
[1] Duke Univ, Dept Biomed Engn, Durham, NC 27708 USA
[2] Duke Univ, Med Ctr, Dept Radiol, Digital Imaging Res Div, Durham, NC 27710 USA
关键词
self-organizing map; cluster analysis; breast cancer; computer-aided diagnosis;
D O I
10.1016/S0933-3657(03)00003-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of this study was to identify and characterize clusters in a heterogeneous breast cancer computer-aided diagnosis database. Identification of subgroups within the database could help elucidate clinical trends and facilitate future model building. A self-organizing map (SOM) was used to identify clusters in a large (2258 cases), heterogeneous computer-aided diagnosis database based on mammographic findings (BI-RADS(TM)) and patient age. The resulting clusters were then characterized by their prototypes determined using a constraint satisfaction neural network (CSNN). The clusters showed logical separation of clinical subtypes such as architectural distortions, masses, and calcifications. Moreover, the broad categories of masses and calcifications were stratified into several clusters (seven for masses and three for calcifications). The percent of the cases that were malignant was notably different among the clusters (ranging from 6 to 83%). A feed-forward back-propagation artificial neural network (BP-ANN) was used to identify likely benign lesions that may be candidates for follow up rather than biopsy. The performance of the BP-ANN varied considerably across the clusters identified by the SOM. in particular, a cluster (#6) of mass cases (6% malignant) was identified that accounted for 79% of the recommendations for follow up that would have been made by the BP-ANN. A classification rule based on the profile of cluster #6 performed comparably to the BP-ANN, providing approximately 25% specificity at 98% sensitivity. This performance was demonstrated to generalize to a large (2177) set of cases held-out for model validation. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:113 / 127
页数:15
相关论文
共 31 条
[1]  
American College of Radiology, 1998, ILL BREAST IM REP DA
[2]  
[Anonymous], 1999, Combining Artificial Neural Nets: Ensemble and Modular Multi-Net Systems
[3]   BREAST-CANCER - PREDICTION WITH ARTIFICIAL NEURAL-NETWORK-BASED ON BI-RADS STANDARDIZED LEXICON [J].
BAKER, JA ;
KORNGUTH, PJ ;
LO, JY ;
WILLIFORD, ME ;
FLOYD, CE .
RADIOLOGY, 1995, 196 (03) :817-822
[4]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[5]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[6]   Breast cancer diagnosis using self-organizing map for sonography [J].
Chen, DR ;
Chang, RF ;
Huang, YL .
ULTRASOUND IN MEDICINE AND BIOLOGY, 2000, 26 (03) :405-411
[7]   Computer-aided diagnosis in radiology: potential and pitfalls [J].
Doi, K ;
MacMahon, H ;
Katsuragawa, S ;
Nishikawa, RM ;
Jiang, YL .
EUROPEAN JOURNAL OF RADIOLOGY, 1999, 31 (02) :97-109
[8]  
Efron B., 1993, INTRO BOOTSTRAP, V1st ed., DOI DOI 10.1201/9780429246593
[9]   Case-based reasoning computer algorithm that uses mammographic findings for breast biopsy decisions [J].
Floyd, CE ;
Lo, JY ;
Tourassi, GD .
AMERICAN JOURNAL OF ROENTGENOLOGY, 2000, 175 (05) :1347-1352
[10]   Computer-aided diagnosis of breast lesions in medical images [J].
Giger, ML .
COMPUTING IN SCIENCE & ENGINEERING, 2000, 2 (05) :39-45