共 50 条
Predictability Bounds of Electronic Health Records
被引:17
|作者:
Dahlem, Dominik
[1
,2
]
Maniloff, Diego
[2
]
Ratti, Carlo
[2
]
机构:
[1] IBM Res Ireland, Dublin 15, Ireland
[2] MIT, Senseable City Lab, Cambridge, MA 02139 USA
来源:
SCIENTIFIC REPORTS
|
2015年
/
5卷
基金:
美国国家科学基金会;
关键词:
PREDICTION;
TIME;
CARE;
INFORMATION;
ENTROPY;
BIAS;
D O I:
10.1038/srep11865
中图分类号:
O [数理科学和化学];
P [天文学、地球科学];
Q [生物科学];
N [自然科学总论];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
The ability to intervene in disease progression given a person's disease history has the potential to solve one of society's most pressing issues: advancing health care delivery and reducing its cost. Controlling disease progression is inherently associated with the ability to predict possible future diseases given a patient's medical history. We invoke an information-theoretic methodology to quantify the level of predictability inherent in disease histories of a large electronic health records dataset with over half a million patients. In our analysis, we progress from zeroth order through temporal informed statistics, both from an individual patient's standpoint and also considering the collective effects. Our findings confirm our intuition that knowledge of common disease progressions results in higher predictability bounds than treating disease histories independently. We complement this result by showing the point at which the temporal dependence structure vanishes with increasing orders of the time-correlated statistic. Surprisingly, we also show that shuffling individual disease histories only marginally degrades the predictability bounds. This apparent contradiction with respect to the importance of time-ordered information is indicative of the complexities involved in capturing the health-care process and the difficulties associated with utilising this information in universal prediction algorithms.
引用
收藏
页数:9
相关论文