Clinical Phenotypic Spectrum of 4095 Individuals with Down Syndrome from Text Mining of Electronic Health Records

被引:7
作者
Havrilla, James Margolin [1 ]
Zhao, Mengge [1 ]
Liu, Cong [2 ]
Weng, Chunhua [2 ]
Helbig, Ingo [3 ,4 ,5 ,6 ]
Bhoj, Elizabeth [7 ,8 ]
Wang, Kai [1 ,3 ,9 ]
机构
[1] Childrens Hosp Philadelphia, Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA
[2] Columbia Univ, Irving Med Ctr, Dept Biomed Informat, New York, NY 10032 USA
[3] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Philadelphia, PA 19104 USA
[4] Childrens Hosp Philadelphia, Div Neurol, Philadelphia, PA 19104 USA
[5] Childrens Hosp Philadelphia, Epilepsy NeuroGenet Initiat ENGIN, Philadelphia, PA 19104 USA
[6] Univ Penn, Perelman Sch Med, Dept Neurol, Philadelphia, PA 19104 USA
[7] Childrens Hosp Philadelphia, Div Human Genet, Philadelphia, PA 19104 USA
[8] Univ Penn, Perelman Sch Med, Dept Pediat, Philadelphia, PA 19104 USA
[9] Univ Penn, Perelman Sch Med, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
关键词
Down syndrome; phenotype; electronic health records; phenotypic spectrum; longitudinal study; natural language processing; text mining; large-scale; INFORMATION; DATABASE; SEQUENCE; CHILDREN; UMLS;
D O I
10.3390/genes12081159
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Human genetic disorders, such as Down syndrome, have a wide variety of clinical phenotypic presentations, and characterizing each nuanced phenotype and subtype can be difficult. In this study, we examined the electronic health records of 4095 individuals with Down syndrome at the Children's Hospital of Philadelphia to create a method to characterize the phenotypic spectrum digitally. We extracted Human Phenotype Ontology (HPO) terms from quality-filtered patient notes using a natural language processing (NLP) approach MetaMap. We catalogued the most common HPO terms related to Down syndrome patients and compared the terms with those from a baseline population. We characterized the top 100 HPO terms by their frequencies at different ages of clinical visits and highlighted selected terms that have time-dependent distributions. We also discovered phenotypic terms that have not been significantly associated with Down syndrome, such as "Proptosis", "Downslanted palpebral fissures", and "Microtia". In summary, our study demonstrated that the clinical phenotypic spectrum of individual with Mendelian diseases can be characterized through NLP-based digital phenotyping on population-scale electronic health records (EHRs).
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Data mining information from electronic health records produced high yield and accuracy for current smoking status
    Groenhof, T. Katrien J.
    Koers, Laurien R.
    Blasse, Enja
    de Groot, Mark
    Grobbee, Diederick E.
    Bots, Michiel L.
    Asselbergs, Folkert W.
    Lely, A. Titia
    Haitjema, Saskia
    van Solinge, Wouter
    Hoefer, Imo
    Haitjema, Saskia
    de Groot, Mark
    Asselbergs, F. W.
    de Borst, G. J.
    Bots, M. L.
    Dieleman, S.
    Emmelot, M. H.
    de Jong, P. A.
    Lely, A. T.
    Hoefer, I. E.
    van der Kaaij, N. P.
    Ruigrok, Y. M.
    Verhaar, M. C.
    Visseren, F. L. J.
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2020, 118 : 100 - 106
  • [32] Clinical Roadmap for Implementing Results from Electronic Health Records Queries
    FitzPatrick, Amy
    Cunningham-Rundles, Charlotte
    Sacco, Keith
    Chin, Aaron
    Butte, Manish
    Hartog, Nicholas
    Izadi, Neema
    Relan, Anurag
    Rider, Nicholas
    CLINICAL IMMUNOLOGY, 2024, 262
  • [33] Estimating medication adherence from Electronic Health Records: comparing methods for mining and processing asthma treatment prescriptions
    Tibble, Holly
    Sheikh, Aziz
    Tsanas, Athanasios
    BMC MEDICAL RESEARCH METHODOLOGY, 2023, 23 (01)
  • [34] Domain over size: Clinical ELECTRA surpasses general BERT for bleeding site classification in the free text of electronic health records
    Pedersen, Jannik S.
    Laursen, Martin S.
    Soguero-Ruiz, Cristina
    Savarimuthu, Thiusius R.
    Hansen, Rasmus Sogaard
    Vinholt, Pernille J.
    2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22), 2022,
  • [35] Identifying vulnerable older adult populations by contextualizing geriatric syndrome information in clinical notes of electronic health records
    Chen, Tao
    Dredze, Mark
    Weiner, Jonathan R.
    Kharrazi, Hadi
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (8-9) : 787 - 795
  • [36] Binary acronym disambiguation in clinical notes from electronic health records with an application in computational phenotyping
    Link, Nicholas B.
    Huang, Sicong
    Cai, Tianrun
    Sun, Jiehuan
    Dahal, Kumar
    Costa, Lauren
    Cho, Kelly
    Liao, Katherine
    Cai, Tianxi
    Hong, Chuan
    Collaboration Million Vet Program, Million Veteran Program
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 162
  • [37] Predicting near-term glaucoma progression: An artificial intelligence approach using clinical free-text notes and data from electronic health records
    Jalamangala Shivananjaiah, Sunil K.
    Kumari, Sneha
    Majid, Iyad
    Wang, Sophia Y.
    FRONTIERS IN MEDICINE, 2023, 10
  • [38] Automated Extraction of Diagnostic Criteria From Electronic Health Records for Autism Spectrum Disorders: Development, Evaluation, and Application
    Leroy, Gondy
    Gu, Yang
    Pettygrove, Sydney
    Galindo, Maureen K.
    Arora, Ananyaa
    Kurzius-Spencer, Margaret
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2018, 20 (11)
  • [39] Adverse Event extraction from Structured Product Labels using the Event-based Text-mining of Health Electronic Records (ETHER) system
    Pandey, Abhishek
    Kreimeyer, Kory
    Foster, Matthew
    Oanh Dang
    Ly, Thomas
    Wang, Wei
    Forshee, Richard
    Botsis, Taxiarchis
    HEALTH INFORMATICS JOURNAL, 2019, 25 (04) : 1232 - 1243
  • [40] Diabetes and Obesity in Down Syndrome Across the Lifespan: A Retrospective Cohort Study Using UK Electronic Health Records
    Aslam, Aisha A.
    Baksh, R. Asaad
    Pape, Sarah E.
    Strydom, Andre
    Gulliford, Martin C.
    Chan, Li F.
    DIABETES CARE, 2022, 45 (12) : 2892 - 2899