Discovery of Hidden Patterns in Breast Cancer Patients, Using Data Mining on a Real Data Set

被引:1
作者
Atashi, Alireza [1 ]
Tohidinezhad, Fariba [2 ]
Dorri, Sara [3 ]
Nazeri, Najmeh [3 ]
Ghousi, Rouzbeh [4 ]
Marashi, Sina [1 ]
Hajialiasgari, Fatemeh [1 ]
机构
[1] Univ Tehran Med Sci, Virtual Sch, E Hlth Dept, Tehran, Iran
[2] Mashhad Univ Med Sci, Dept Med Informat, Fac Med, Mashhad, Razavi Khorasan, Iran
[3] ACECR, Dept Clin Res, Breast Canc Res Ctr, Motamed Canc Inst, Tehran, Iran
[4] Iran Univ Sci & Technol, Sch Ind Engn, Tehran, Iran
来源
HEALTH INFORMATICS VISION: FROM DATA VIA INFORMATION TO KNOWLEDGE | 2019年 / 262卷
关键词
Breast Cancer; Data Mining; Association Rule Mining; Pattern Recognition; Knowledge Discovery; Iran;
D O I
10.3233/SHTI190037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim is to recognize the unknown atterns in a real breast cancer dataset using data mining algorithms as a new method in medicine. Due to excessive missing data in the collection only data on 665 of 809 patients were available. The other missing values were estimated using the EM algorithm in SPSS21 software. Fields have been converted into discrete fields and finally the APRIORI algorithm has been used to analyze and explore the unknown patterns. After the rule extraction, experts in the field of breast cancer eliminated redundant and meaningless relations. 100 association rules with a confidence value of more than 0.9 explored by the APRIORI algorithm and after the clinical expert feedback, 10 clinically meaningful relations have been detected and reported. Due to the high number of risk factors, the use of data mining is effective for cancer data. These patterns provide the future study hypotheses of specific clinical studies.
引用
收藏
页码:142 / 145
页数:4
相关论文
共 9 条
[1]  
Aalaei S, 2016, IRAN J BASIC MED SCI, V19, P476
[2]  
Ahmad P., 2015, INT J COMPUTER APPL, V120
[3]   Trends in net survival from 15 cancers in six European Latin countries: the SUDCAN population-based study material [J].
Crocetti, Emanuele ;
Bossard, Nadine ;
Uhry, Zoe ;
Roche, Laurent ;
Rossi, Silvia ;
Capocaccia, Riccardo ;
Faivre, Jean .
EUROPEAN JOURNAL OF CANCER PREVENTION, 2017, 26 :S3-S8
[4]  
Delen D., 2010, ARTIFICIAL INTELLIGE, V34, P113
[5]   Applying Data Mining Techniques to Improve Breast Cancer Diagnosis [J].
Diz, Joana ;
Marreiros, Goreti ;
Freitas, Alberto .
JOURNAL OF MEDICAL SYSTEMS, 2016, 40 (09)
[6]  
Fitzmaurice C, 2017, JAMA ONCOL, V3, P524, DOI [10.1001/jamaoncol.2016.5688, 10.1001/jamaoncol.2018.2706]
[7]  
Kiani B., 2014, J HLTH BIOMEDICAL IN, V1, P26
[8]   Technology of building an expert system based on a set of quantitative features of tumor cell nuclei for diagnosing breast cancer [J].
Kirillov, Vladimir .
DIAGNOSTIC CYTOPATHOLOGY, 2013, 41 (06) :475-484
[9]  
Tolouei-Sh A., 2012, IRANIAN J BREAST DIS, V5, P23