Early Detection of Lung Cancer Risk Using Data Mining

被引:41
作者
Ahmed, Kawsar [1 ]
Abdullah-Al-Emran [2 ]
Jesmin, Tasnuba [1 ]
Mukti, Roushney Fatima [2 ]
Rahman, Md Zamilur [1 ]
Ahmed, Farzana [3 ]
机构
[1] Mawlana Bhashani Sci & Technol Univ, Dept Informat & Commun Technol, Tangail, Bangladesh
[2] Mawlana Bhashani Sci & Technol Univ, Dept Biotechnol & Genet Engn, Tangail, Bangladesh
[3] BRAC Univ, Dept Math & Nat Sci, Dhaka, Bangladesh
关键词
Data mining; pre-processing; disease diagnosis; aprioriTid algorithm; DT algorithm; Bangladesh;
D O I
10.7314/APJCP.2013.14.1.595
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: Lung cancer is the leading cause of cancer death worldwide Therefore, identification of genetic as well as environmental factors is very important in developing novel methods of lung cancer prevention. However, this is a multi-layered problem. Therefore a lung cancer risk prediction system is here proposed which is easy, cost effective and time saving. Materials and Methods: Initially 400 cancer and non-cancer patients' data were collected from different diagnostic centres, pre-processed and clustered using a K-means clustering algorithm for identifying relevant and non-relevant data. Next significant frequent patterns are discovered using AprioriTid and a decision tree algorithm. Results: Finally using the significant pattern prediction tools for a lung cancer prediction system were developed. This lung cancer risk prediction system should prove helpful in detection of a person's predisposition for lung cancer. Conclusions: Most of people of Bangladesh do not even know they have lung cancer and the majority of cases are diagnosed at late stages when cure is impossible. Therefore early prediction of lung cancer should play a pivotal role in the diagnosis process and for an effective preventive strategy.
引用
收藏
页码:595 / 598
页数:4
相关论文
共 11 条
[1]  
Ben-Haim Y, 2010, J MACH LEARN RES, V11, P849
[2]   Genetics of lung-cancer susceptibility [J].
Brennan, Paul ;
Hainaut, Pierre ;
Boffetta, Paolo .
LANCET ONCOLOGY, 2011, 12 (04) :399-408
[3]   Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering [J].
de Amorim, Renato Cordeiro ;
Mirkin, Boris .
PATTERN RECOGNITION, 2012, 45 (03) :1061-1075
[4]  
Ferlay J, 2010, IARC, V10, P220
[5]  
Gothwal H., 2011, Journal of Biomedical Science and Engineering, V4, P289, DOI DOI 10.4236/JBISE.2011.44039
[6]   A Novel Classification Method for Diagnosis of Diabetes Mellitus Using Artificial Neural Networks [J].
Jayalakshmi, T. ;
Santhakumaran, A. .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA STORAGE AND DATA ENGINEERING (DSDE 2010), 2010, :159-163
[7]  
Lan C, 2010, COMPUTER APPL SOFTWA, V27, P234
[8]  
Pradhan M., 2011, Int. J. Comput. Sci. Emerg. Technol, V2, P303
[9]  
Sapon MA, 2011, INT PROC COMPUT SCI, V7, P299
[10]   Radon in Indoor Spaces An Underestimated Risk Factor for Lung Cancer in Environmental Medicine [J].
Schmid, Klaus ;
Kuwert, Torsten ;
Drexler, Hans .
DEUTSCHES ARZTEBLATT INTERNATIONAL, 2010, 107 (11) :181-U9