Incorporating repeating temporal association rules in Naive Bayes classifiers for coronary heart disease diagnosis

被引:24
作者
Orphanou, Kalia [1 ]
Dagliati, Arianna [3 ,6 ,7 ]
Sacchi, Lucia [2 ,3 ]
Stassopoulou, Athena [5 ]
Keravnou, Elpida [1 ]
Bellazzi, Riccardo [2 ,3 ,4 ]
机构
[1] Univ Cyprus, Dept Comp Sci, Nicosia, Cyprus
[2] Univ Pavia, Dept Elect Comp & Biomed Engn, Pavia, Italy
[3] Univ Pavia, Ctr Hlth Technol, Pavia, Italy
[4] IRCCS Ist Clin Sci Maugeri, Pavia, Italy
[5] Univ Nicosia, Dept Comp Sci, Nicosia, Cyprus
[6] Univ Manchester, Ctr Hlth Informat, Manchester, Lancs, England
[7] Univ Manchester, Manchester Mol Pathol Innovat Ctr, Manchester, Lancs, England
关键词
Bayesian models; Time series classification; Temporal abstraction; Temporal reasoning; Temporal association rules; TIME-SERIES; MINING APPROACH; ABSTRACTION; PATTERNS; PREDICTION; FRAMINGHAM; FRAMEWORK; RECORDS; RISK;
D O I
10.1016/j.jbi.2018.03.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we develop a Naive Bayes classification model integrated with temporal association rules (TARs). A temporal pattern mining algorithm is used to detect TARs by identifying the most frequent temporal relationships among the derived basic temporal abstractions (TA). We develop and compare three classifiers that use as features the most frequent TARS as follows: (i) representing the most frequent TARs detected within the target class ('Disease = Present'), (ii) representing the most frequent TARs from both classes ('Disease = Present', 'Disease = Absent'), (iii) representing the most frequent TARS, after removing the ones that are low-risk predictors for the disease. These classifiers incorporate the horizontal support of TARs, which defines the number of times that a particular temporal pattern is found in some patient's record, as their features. All of the developed classifiers are applied for diagnosis of coronary heart disease (CHD) using a longitudinal dataset. We compare two ways of feature representation, using horizontal support or the mean duration of each TAR, on a single patient. The results obtained from this comparison show that the horizontal support representation outperforms the mean duration. The main effort of our research is to demonstrate that where long time periods are of significance in some medical domain, such as the CHD domain, the detection of the repeated occurrences of the most frequent TARS can yield better performances. We compared the classifier that uses the horizontal support representation and has the best performance with a Baseline Classifier which uses the binary representation of the most frequent TARs. The results obtained illustrate the comparatively high performance of the classifier representing the horizontal support, over the Baseline Classifier.
引用
收藏
页码:74 / 82
页数:9
相关论文
共 68 条
[1]   Dynamical Phenotyping: Using Temporal Analysis of Clinically Collected Physiologic Data to Stratify Populations [J].
Albers, D. J. ;
Elhadad, Noemie ;
Tabak, E. ;
Perotte, A. ;
Hripcsak, George .
PLOS ONE, 2014, 9 (06)
[2]  
[Anonymous], 2006, Introduction to Data Mining
[3]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[4]  
[Anonymous], 2002, Dynamic bayesian networks: representation, inference and learning
[5]  
Batal I., 2009, AMIA ANN S P, V2009
[6]   An efficient pattern mining approach for event detection in multivariate temporal data [J].
Batal, Iyad ;
Cooper, Gregory F. ;
Fradkin, Dmitriy ;
Harrison, James, Jr. ;
Moerchen, Fabian ;
Hauskrecht, Milos .
KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 46 (01) :115-150
[7]   A Temporal Pattern Mining Approach for Classifying Electronic Health Record Data [J].
Batal, Iyad ;
Valizadegan, Hamed ;
Cooper, Gregory F. ;
Hauskrecht, Milos .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (04)
[8]  
Batista GEAPA, 2005, LECT NOTES COMPUT SC, V3646, P24
[9]   Temporal data mining for the quality assessment of hemodialysis services [J].
Belazzi, R ;
Larizza, C ;
Magni, P ;
Bellazzi, R .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 34 (01) :25-39
[10]  
Bishop C.M., 2006, PATTERN RECOGN, V4, P738, DOI DOI 10.1117/1.2819119