Characterizing Mammography Reports for Health Analytics

被引:0
作者
Carlos C. Rojas
Robert M. Patton
Barbara G. Beckerman
机构
[1] Oak Ridge National Lab,
来源
Journal of Medical Systems | 2011年 / 35卷
关键词
Text analysis; Clinical notes; Temporal analysis; Mammography reports;
D O I
暂无
中图分类号
学科分类号
摘要
As massive collections of digital health data are becoming available, the opportunities for large-scale automated analysis increase. In particular, the widespread collection of detailed health information is expected to help realize a vision of evidence-based public health and patient-centric health care. Within such a framework for large scale health analytics we describe the transformation of a large data set of mostly unlabeled and free-text mammography data into a searchable and accessible collection, usable for analytics. We also describe several methods to characterize and analyze the data, including their temporal aspects, using information retrieval, supervised learning, and classical statistical techniques. We present experimental results that demonstrate the validity and usefulness of the approach, since the results are consistent with the known features of the data, provide novel insights about it, and can be used in specific applications. Additionally, based on the process of going from raw data to results from analysis, we present the architecture of a generic system for health analytics from clinical notes.
引用
收藏
页码:1197 / 1210
页数:13
相关论文
共 43 条
[1]  
Aronow DB(1999)Ad hoc classification of radiology reports J. Am. Med. Inform. Assoc. 6 393-411
[2]  
Fangfang F(2008)IBM’s vision for the future in patient-centric global health care: IBM’s vision of how advanced health analytics and automated health information infrastructure will transform anatomic pathology services Arch. Pathol. Lab. Med. 132 766-771
[3]  
Croft WB(2000)Computer-aided diagnosis of breast lesions in medical images Comput. Sci. Eng. 2 39-45
[4]  
Bakalar R(2004)Syndromic surveillance in public health practice, New York City Emerg. Infect. Dis. 10 858-864
[5]  
Giger M(2010)Temporal pattern discovery in longitudinal electronic patient records Data Mining and Knowledge Discovery 20 1-27
[6]  
Heffernan R(1980)An algorithm for suffix stripping Program: Electronic Library and Information Systems 14 130-137
[7]  
Mostashari F(2007)Importance of comparison of current and prior mammograms in breast cancer screening Radiology 242 70-538
[8]  
Das D(2008)Negation recognition in medical narrative reports Inf. Retr. 11 499-49
[9]  
Karpati A(1978)Dynamic programming algorithm optimization for spoken word recognition IEEE Transactions on Acoustics, Speech and Signal Processing 26 43-523
[10]  
Kulldorff M(1988)Term-weighting approaches in automatic text retrieval Inf. Process. Manag. 24 513-47