Clinical Phenotypic Spectrum of 4095 Individuals with Down Syndrome from Text Mining of Electronic Health Records

被引:7
|
作者
Havrilla, James Margolin [1 ]
Zhao, Mengge [1 ]
Liu, Cong [2 ]
Weng, Chunhua [2 ]
Helbig, Ingo [3 ,4 ,5 ,6 ]
Bhoj, Elizabeth [7 ,8 ]
Wang, Kai [1 ,3 ,9 ]
机构
[1] Childrens Hosp Philadelphia, Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA
[2] Columbia Univ, Irving Med Ctr, Dept Biomed Informat, New York, NY 10032 USA
[3] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Philadelphia, PA 19104 USA
[4] Childrens Hosp Philadelphia, Div Neurol, Philadelphia, PA 19104 USA
[5] Childrens Hosp Philadelphia, Epilepsy NeuroGenet Initiat ENGIN, Philadelphia, PA 19104 USA
[6] Univ Penn, Perelman Sch Med, Dept Neurol, Philadelphia, PA 19104 USA
[7] Childrens Hosp Philadelphia, Div Human Genet, Philadelphia, PA 19104 USA
[8] Univ Penn, Perelman Sch Med, Dept Pediat, Philadelphia, PA 19104 USA
[9] Univ Penn, Perelman Sch Med, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
关键词
Down syndrome; phenotype; electronic health records; phenotypic spectrum; longitudinal study; natural language processing; text mining; large-scale; INFORMATION; DATABASE; SEQUENCE; CHILDREN; UMLS;
D O I
10.3390/genes12081159
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Human genetic disorders, such as Down syndrome, have a wide variety of clinical phenotypic presentations, and characterizing each nuanced phenotype and subtype can be difficult. In this study, we examined the electronic health records of 4095 individuals with Down syndrome at the Children's Hospital of Philadelphia to create a method to characterize the phenotypic spectrum digitally. We extracted Human Phenotype Ontology (HPO) terms from quality-filtered patient notes using a natural language processing (NLP) approach MetaMap. We catalogued the most common HPO terms related to Down syndrome patients and compared the terms with those from a baseline population. We characterized the top 100 HPO terms by their frequencies at different ages of clinical visits and highlighted selected terms that have time-dependent distributions. We also discovered phenotypic terms that have not been significantly associated with Down syndrome, such as "Proptosis", "Downslanted palpebral fissures", and "Microtia". In summary, our study demonstrated that the clinical phenotypic spectrum of individual with Mendelian diseases can be characterized through NLP-based digital phenotyping on population-scale electronic health records (EHRs).
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Text Mining Electronic Health Records to Identify Hospital Adverse Events
    Gerdes, Lars Ulrik
    Hardahl, Christian
    MEDINFO 2013: PROCEEDINGS OF THE 14TH WORLD CONGRESS ON MEDICAL AND HEALTH INFORMATICS, PTS 1 AND 2, 2013, 192 : 1145 - 1145
  • [2] Development of a text mining algorithm for identifying adverse drug reactions in electronic health records
    van de Burgt, Britt W. M.
    Wasylewicz, Arthur T. M.
    Dullemond, Bjorn
    Jessurun, Naomi T.
    Grouls, Rene J. E.
    Bouwman, R. Arthur
    Korsten, Erik H. M.
    Egberts, Toine C. G.
    JAMIA OPEN, 2024, 7 (03)
  • [3] Empirical advances with text mining of electronic health records
    T. Delespierre
    P. Denormandie
    A. Bar-Hen
    L. Josseran
    BMC Medical Informatics and Decision Making, 17
  • [4] Empirical advances with text mining of electronic health records
    Delespierre, T.
    Denormandie, P.
    Bar-Hen, A.
    Josseran, L.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2017, 17
  • [5] Exploring the Association of Cancer and Depression in Electronic Health Records: Combining Encoded Diagnosis and Mining Free-Text Clinical Notes
    Leis, Angela
    Casadevall, David
    Albanell, Joan
    Posso, Margarita
    Macia, Francesc
    Castells, Xavier
    Ramirez-Anguita, Juan Manuel
    Roldan, Jordi Martinez
    Furlong, Laura, I
    Sanz, Ferran
    Ronzano, Francesco
    Mayer, Miguel A.
    JMIR CANCER, 2022, 8 (03):
  • [6] Predicting Hospital Readmission for Campylobacteriosis from Electronic Health Records: A Machine Learning and Text Mining Perspective
    Zhou, Shang-Ming
    Lyons, Ronan A.
    Rahman, Muhammad A.
    Holborow, Alexander
    Brophy, Sinead
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (01):
  • [7] Automatic identification of heart failure diagnostic criteria, using text analysis of clinical notes from electronic health records
    Byrd, Roy J.
    Steinhubl, Steven R.
    Sun, Jimeng
    Ebadollahi, Shahram
    Stewart, Walter F.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2014, 83 (12) : 983 - 992
  • [8] Citywide quality of health information system through text mining of electronic health records
    Anastasia A. Funkner
    Michil P. Egorov
    Sergey A. Fokin
    Gennady M. Orlov
    Sergey V. Kovalchuk
    Applied Network Science, 6
  • [9] Citywide quality of health information system through text mining of electronic health records
    Funkner, Anastasia A.
    Egorov, Michil P.
    Fokin, Sergey A.
    Orlov, Gennady M.
    Kovalchuk, Sergey, V
    APPLIED NETWORK SCIENCE, 2021, 6 (01)
  • [10] Mining electronic health records: towards better research applications and clinical care
    Jensen, Peter B.
    Jensen, Lars J.
    Brunak, Soren
    NATURE REVIEWS GENETICS, 2012, 13 (06) : 395 - 405