Clinical Phenotypic Spectrum of 4095 Individuals with Down Syndrome from Text Mining of Electronic Health Records

被引:7
作者
Havrilla, James Margolin [1 ]
Zhao, Mengge [1 ]
Liu, Cong [2 ]
Weng, Chunhua [2 ]
Helbig, Ingo [3 ,4 ,5 ,6 ]
Bhoj, Elizabeth [7 ,8 ]
Wang, Kai [1 ,3 ,9 ]
机构
[1] Childrens Hosp Philadelphia, Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA
[2] Columbia Univ, Irving Med Ctr, Dept Biomed Informat, New York, NY 10032 USA
[3] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Philadelphia, PA 19104 USA
[4] Childrens Hosp Philadelphia, Div Neurol, Philadelphia, PA 19104 USA
[5] Childrens Hosp Philadelphia, Epilepsy NeuroGenet Initiat ENGIN, Philadelphia, PA 19104 USA
[6] Univ Penn, Perelman Sch Med, Dept Neurol, Philadelphia, PA 19104 USA
[7] Childrens Hosp Philadelphia, Div Human Genet, Philadelphia, PA 19104 USA
[8] Univ Penn, Perelman Sch Med, Dept Pediat, Philadelphia, PA 19104 USA
[9] Univ Penn, Perelman Sch Med, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
关键词
Down syndrome; phenotype; electronic health records; phenotypic spectrum; longitudinal study; natural language processing; text mining; large-scale; INFORMATION; DATABASE; SEQUENCE; CHILDREN; UMLS;
D O I
10.3390/genes12081159
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Human genetic disorders, such as Down syndrome, have a wide variety of clinical phenotypic presentations, and characterizing each nuanced phenotype and subtype can be difficult. In this study, we examined the electronic health records of 4095 individuals with Down syndrome at the Children's Hospital of Philadelphia to create a method to characterize the phenotypic spectrum digitally. We extracted Human Phenotype Ontology (HPO) terms from quality-filtered patient notes using a natural language processing (NLP) approach MetaMap. We catalogued the most common HPO terms related to Down syndrome patients and compared the terms with those from a baseline population. We characterized the top 100 HPO terms by their frequencies at different ages of clinical visits and highlighted selected terms that have time-dependent distributions. We also discovered phenotypic terms that have not been significantly associated with Down syndrome, such as "Proptosis", "Downslanted palpebral fissures", and "Microtia". In summary, our study demonstrated that the clinical phenotypic spectrum of individual with Mendelian diseases can be characterized through NLP-based digital phenotyping on population-scale electronic health records (EHRs).
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Utilizing big data from electronic health records in pediatric clinical care
    Macias, Charles G.
    Remy, Kenneth E.
    Barda, Amie J.
    [J]. PEDIATRIC RESEARCH, 2023, 93 (02) : 382 - 389
  • [42] Current approaches to identify sections within clinical narratives from electronic health records: a systematic review
    Pomares-Quimbaya, Alexandra
    Kreuzthaler, Markus
    Schulz, Stefan
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2019, 19 (1) : 155
  • [43] Electronic Health Records in Sweden: From Administrative Management to Clinical Decision Support
    Kajbjer, Karin
    Nordberg, Ragnar
    Klein, Gunnar O.
    [J]. HISTORY OF NORDIC COMPUTING 3, 2011, 350 : 74 - +
  • [44] Practical use case of natural language processing for observational clinical research data retrieval from electronic health records: AssistMED project
    Maciejewski, Cezary
    Ozieranski, Krzysztof
    Basza, Mikolaj
    Barwiolek, Adam
    Ciurla, Michalina
    Bozym, Aleksandra
    Krajsman, Maciej J.
    Lodzinski, Piotr
    Opolski, Grzegorz
    Grabowski, Marcin
    Cacko, Andrzej
    Balsam, Pawel
    [J]. POLISH ARCHIVES OF INTERNAL MEDICINE-POLSKIE ARCHIWUM MEDYCYNY WEWNETRZNEJ, 2024, 134 (05):
  • [45] Generating and Reporting Electronic Clinical Quality Measures from Electronic Health Records: Strategies from EvidenceNOW Cooperatives
    Richardson, Joshua E.
    Rasmussen, Luke, V
    Dorr, David A.
    Sirkin, Jenna T.
    Shelley, Donna
    Rivera, Adovich
    Wu, Winfred
    Cykert, Samuel
    Cohen, Deborah J.
    Kho, Abel N.
    [J]. APPLIED CLINICAL INFORMATICS, 2022, 13 (02): : 485 - 494
  • [46] Health-Related Quality of Life in Individuals with Down Syndrome: Results from a Non-Interventional Longitudinal Multi-National Study
    Rofail, Diana
    Froggatt, Daniel
    de la Torre, Rafael
    Edgin, Jamie
    Kishnani, Priya
    Touraine, Renaud
    Whitwham, Sarah
    Squassante, Lisa
    Khwaja, Omar
    D'Ardhuy, Xavier Liogier
    [J]. ADVANCES IN THERAPY, 2017, 34 (08) : 2058 - 2069
  • [47] Additional Value From Free-Text Diagnoses in Electronic Health Records: Hybrid Dictionary and Machine Learning Classification Study
    Mehra, Tarun
    Wekhof, Tobias
    Keller, Dagmar Iris
    [J]. JMIR MEDICAL INFORMATICS, 2024, 12
  • [48] Harnessing the power of electronic health records and open natural language data mining to capture meaningful patient experience during routine clinical care
    Larrow, Danielle R.
    Kadosh, Orna Katz
    Fracchia, Shannon
    Radano, Marcella
    Hartnick, Christopher J.
    [J]. INTERNATIONAL JOURNAL OF PEDIATRIC OTORHINOLARYNGOLOGY, 2023, 173
  • [49] Predicting Clinical Diagnosis from Patients Electronic Health Records Using BERT-Based Neural Networks
    Blinov, Pavel
    Avetisian, Manvel
    Kokh, Vladimir
    Umerenkov, Dmitry
    Tuzhilin, Alexander
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2020), 2020, : 111 - 121
  • [50] Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance
    Wei, Wei-Qi
    Teixeira, Pedro L.
    Mo, Huan
    Cronin, Robert M.
    Warner, Jeremy L.
    Denny, Joshua C.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (E1) : E20 - E27