Clinical Phenotypic Spectrum of 4095 Individuals with Down Syndrome from Text Mining of Electronic Health Records

被引:7
|
作者
Havrilla, James Margolin [1 ]
Zhao, Mengge [1 ]
Liu, Cong [2 ]
Weng, Chunhua [2 ]
Helbig, Ingo [3 ,4 ,5 ,6 ]
Bhoj, Elizabeth [7 ,8 ]
Wang, Kai [1 ,3 ,9 ]
机构
[1] Childrens Hosp Philadelphia, Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA
[2] Columbia Univ, Irving Med Ctr, Dept Biomed Informat, New York, NY 10032 USA
[3] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Philadelphia, PA 19104 USA
[4] Childrens Hosp Philadelphia, Div Neurol, Philadelphia, PA 19104 USA
[5] Childrens Hosp Philadelphia, Epilepsy NeuroGenet Initiat ENGIN, Philadelphia, PA 19104 USA
[6] Univ Penn, Perelman Sch Med, Dept Neurol, Philadelphia, PA 19104 USA
[7] Childrens Hosp Philadelphia, Div Human Genet, Philadelphia, PA 19104 USA
[8] Univ Penn, Perelman Sch Med, Dept Pediat, Philadelphia, PA 19104 USA
[9] Univ Penn, Perelman Sch Med, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
关键词
Down syndrome; phenotype; electronic health records; phenotypic spectrum; longitudinal study; natural language processing; text mining; large-scale; INFORMATION; DATABASE; SEQUENCE; CHILDREN; UMLS;
D O I
10.3390/genes12081159
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Human genetic disorders, such as Down syndrome, have a wide variety of clinical phenotypic presentations, and characterizing each nuanced phenotype and subtype can be difficult. In this study, we examined the electronic health records of 4095 individuals with Down syndrome at the Children's Hospital of Philadelphia to create a method to characterize the phenotypic spectrum digitally. We extracted Human Phenotype Ontology (HPO) terms from quality-filtered patient notes using a natural language processing (NLP) approach MetaMap. We catalogued the most common HPO terms related to Down syndrome patients and compared the terms with those from a baseline population. We characterized the top 100 HPO terms by their frequencies at different ages of clinical visits and highlighted selected terms that have time-dependent distributions. We also discovered phenotypic terms that have not been significantly associated with Down syndrome, such as "Proptosis", "Downslanted palpebral fissures", and "Microtia". In summary, our study demonstrated that the clinical phenotypic spectrum of individual with Mendelian diseases can be characterized through NLP-based digital phenotyping on population-scale electronic health records (EHRs).
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Enhancing Delirium Case Definitions in Electronic Health Records Using Clinical Free Text
    McCoy, Thomas H., Jr.
    Chaukos, Deanna C.
    Snapper, Leslie A.
    Hart, Kamber L.
    Stern, Theodore A.
    Perlis, Roy H.
    PSYCHOSOMATICS, 2017, 58 (02) : 113 - 120
  • [22] Text mining of electronic health records can validate a register-based diagnosis of epilepsy and subgroup into focal and generalized epilepsy
    Vulpius, Siri A.
    Werge, Sebastian
    Jorgensen, Isabella Friis
    Siggaard, Troels
    Hernansanz Biel, Jorge
    Knudsen, Gitte M.
    Brunak, Soren
    Pinborg, Lars H.
    EPILEPSIA, 2023, 64 (10) : 2750 - 2760
  • [23] Pattern-based Mining in Electronic Health Records for Complex Clinical Process Analysis
    Metsker, Oleg
    Bolgova, Ekaterina
    Yakovlev, Alexey
    Funkner, Anastasia
    Kovalchuk, Sergey
    6TH INTERNATIONAL YOUNG SCIENTIST CONFERENCE ON COMPUTATIONAL SCIENCE, YSC 2017, 2017, 119 : 197 - 206
  • [24] Retrieving Clinical and Omic Data from Electronic Health Records
    Cabot, Chloe
    Lelong, Romain
    Grosjean, Julien
    Soualmia, Lina F.
    Darmoni, Stefan J.
    TRANSFORMING HEALTHCARE WITH THE INTERNET OF THINGS, 2016, 221 : 115 - 115
  • [25] HTL Model: A Model for Extracting and Visualizing Medical Events from Narrative Text in Electronic Health Records
    Paul Hernandez, Eddie
    Pomares Quimbaya, Alexandra
    Mauricio Munoz, Oscar
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES FOR AGEING WELL AND E-HEALTH (ICT4AWE), 2016, : 107 - 114
  • [26] HEALTH DISPARITIES AND INTELLECTUAL DISABILITIES: LESSONS FROM INDIVIDUALS WITH DOWN SYNDROME
    Booth, Karin Vander Ploeg
    DEVELOPMENTAL DISABILITIES RESEARCH REVIEWS, 2011, 17 (01) : 32 - 35
  • [27] Lexical stability of psychiatric clinical notes from electronic health records over a decade
    Hansen, Lasse
    Enevoldsen, Kenneth
    Bernstorff, Martin
    Perfalk, Erik
    Danielsen, Andreas A.
    Nielbo, Kristoffer L.
    Ostergaard, Soren D.
    ACTA NEUROPSYCHIATRICA, 2023,
  • [28] Using anchors from free text in electronic health records to diagnose postoperative delirium
    Mikalsen, Karl Oyvind
    Soguero-Ruiz, Cristina
    Jensen, Kasper
    Hindberg, Kristian
    Gran, Mads
    Revhaug, Arthur
    Lindsetmo, Rolv-Ole
    Skrovseth, Stein Olav
    Godtliebsen, Fred
    Jenssen, Robert
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 152 : 105 - 114
  • [29] Preferences for the research use of electronic health records among young adults with fragile X syndrome or autism spectrum disorder
    Wagner, Laura
    Frisch, MaryKate
    Turner-Brown, Lauren
    Andrews, Sara
    Edwards, Anne
    Moultrie, Rebecca
    Rivas, Alexandra Alvarez
    Wheeler, Anne
    Raspa, Melissa
    DISABILITY AND HEALTH JOURNAL, 2020, 13 (04)
  • [30] Data mining to retrieve smoking status from electronic health records in general practice
    de Boer, Annemarijn R.
    de Groot, Mark C. H.
    Groenhof, T. Katrien J.
    van Doorn, Sander
    Vaartjes, Ilonca
    Bots, Michiel L.
    Haitjema, Saskia
    EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2022, 3 (03): : 437 - 444