High Throughput Phenotyping for Dimensional Psychopathology in Electronic Health Records

被引:46
作者
McCoy, Thomas H., Jr. [1 ,2 ]
Yu, Sheng [3 ,4 ,5 ]
Hart, Kamber L. [1 ,2 ]
Castro, Victor M. [1 ,2 ]
Brown, Hannah E. [1 ,2 ]
Rosenquist, James N. [1 ,2 ]
Doyle, Alysa E. [1 ,2 ]
Vuijk, Pieter J. [1 ,2 ]
Cai, Tianxi [3 ,4 ]
Perlis, Roy H. [1 ,2 ]
机构
[1] Massachusetts Gen Hosp, Ctr Quantitat Hlth, Boston, MA 02114 USA
[2] Massachusetts Gen Hosp, Dept Psychiat, Boston, MA 02114 USA
[3] Harvard Med Sch, Boston, MA USA
[4] Harvard Sch Publ Hlth, Boston, MA USA
[5] Tsinghua Univ, Beijing, Peoples R China
关键词
Computed phenotype; Electronic health record; Natural language processing; Research Domain Criteria; Topic modeling; Transdiagnostic; RESEARCH DOMAIN CRITERIA; LARGE-SCALE; CLASSIFICATION; ENTERPRISE; DISORDERS;
D O I
10.1016/j.biopsych.2018.01.011
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
BACKGROUND: Relying on diagnostic categories of neuropsychiatric illness obscures the complexity of these disorders. Capturing multiple dimensional measures of neuropathology could facilitate the clinical and neurobiological investigation of cognitive and behavioral phenotypes. METHODS: We developed a natural language processing-based approach to extract five symptom dimensions, based on the National Institute of Mental Health Research Domain Criteria definitions, from narrative clinical notes. Estimates of Research Domain Criteria loading were derived from a cohort of 3619 individuals with 4623 hospital admissions. We applied this tool to a large corpus of psychiatric inpatient admission and discharge notes (2010-2015), and using the same cohort we examined face validity, predictive validity, and convergent validity with gold standard annotations. RESULTS: In mixed-effect models adjusted for sociodemographic and clinical features, greater negative and positive symptom domains were associated with a shorter length of stay (beta = -.88, p = .001 and beta = -1.22, p < .001, respectively), while greater social and arousal domain scores were associated with a longer length of stay (beta = .93, p < .001 and beta = .81, p = .007, respectively). In fully adjusted Cox regression models, a greater positive domain score at discharge was also associated with a significant increase in readmission risk (hazard ratio = 1.22, p < .001). Positive and negative valence domains were correlated with expert annotation (by analysis of variance [df = 3], R-2 = .13 and .19, respectively). Likewise, in a subset of patients, neurocognitive testing was correlated with cognitive performance scores (p < .008 for three of six measures). CONCLUSIONS: This shows that natural language processing can be used to efficiently and transparently score clinical notes in terms of cognitive and psychopathologic domains.
引用
收藏
页码:997 / 1004
页数:8
相关论文
共 43 条
[1]  
[Anonymous], UMLS REFERENCE MANUA
[2]   Bootstrap methods for developing predictive models [J].
Austin, PC ;
Tu, JV .
AMERICAN STATISTICIAN, 2004, 58 (02) :131-137
[3]   An atlas of genetic correlations across human diseases and traits [J].
Bulik-Sullivan, Brendan ;
Finucane, Hilary K. ;
Anttila, Verneri ;
Gusev, Alexander ;
Day, Felix R. ;
Loh, Po-Ru ;
Duncan, Laramie ;
Perry, John R. B. ;
Patterson, Nick ;
Robinson, Elise B. ;
Daly, Mark J. ;
Price, Alkes L. ;
Neale, Benjamin M. .
NATURE GENETICS, 2015, 47 (11) :1236-+
[4]   Large-Scale Multiple Testing of Correlations [J].
Cai, T. Tony ;
Liu, Weidong .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) :229-240
[5]  
Chapman WW, 2001, J AM MED INFORM ASSN, P105
[6]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[7]  
2-9
[8]   Automated Neuropsychological Test Battery (CANTAB) in mild cognitive impairment and in Alzheimer's disease [J].
Egerhazi, Aniko ;
Berecz, Roland ;
Bartok, Eniko ;
Degrell, Istvan .
PROGRESS IN NEURO-PSYCHOPHARMACOLOGY & BIOLOGICAL PSYCHIATRY, 2007, 31 (03) :746-751
[9]   Using the Cambridge Neuropsychological Test Automated Battery (CANTAB) to assess the cognitive impact of electroconvulsive therapy on visual and visuospatial memory [J].
Falconer, D. W. ;
Cleland, J. ;
Fielding, S. ;
Reid, I. C. .
PSYCHOLOGICAL MEDICINE, 2010, 40 (06) :1017-1025
[10]  
Filannino M, 2016, 2016 CEGS N GRID SHA