Computational phenotyping with the All of Us Research Program: identifying underrepresented people with HIV or at risk of HIV

被引:2
作者
Yang, Xueying [1 ,2 ,6 ]
Zhang, Jiajia [1 ,3 ]
Cai, Ruilie [1 ]
Liang, Chen [1 ,4 ]
Olatosi, Bankole [1 ,4 ]
Weissman, Sharon [1 ,5 ]
Li, Xiaoming [1 ,2 ]
机构
[1] Univ South Carolina, South Carolina SmartState Ctr Healthcare Qual, Arnold Sch Publ Hlth, Columbia, SC 29208 USA
[2] Univ South Carolina, Arnold Sch Publ Hlth, Dept Hlth Promot Educ & Behav, Columbia, SC 29208 USA
[3] Univ South Carolina, Arnold Sch Publ Hlth, Dept Epidemiol & Biostat, Columbia, SC 29208 USA
[4] Univ South Carolina, Arnold Sch Publ Hlth, Dept Hlth Serv Policy & Management, Columbia, SC 29208 USA
[5] Univ South Carolina, Sch Med, Dept Internal Med, Columbia, SC 29208 USA
[6] Univ South Carolina, South Carolina SmartState Ctr Healthcare Qual, Arnold Sch Publ Hlth, Dept Hlth Promot, 915 Greene St,Discover I Suite 534B, Columbia, SC 29208 USA
关键词
HIV/AIDS; phenotyping; All of Us; underrepresented;
D O I
10.1093/jamiaopen/ooad071
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: This study aims to identify the people living with HIV (PWH) and pre-exposure prophylaxis (PrEP) users in the All of Us (AoU) database by integrating information from both electronic health record (EHR)- and self-reported survey data. Methods: We identified PWH and PrEP users if they met the inclusion criterion by conditions, lab measurements, or medications related to HIV in EHR data or confirmed questions in the Survey data. Results: We evaluated the latest data release through July 1, 2022 in AoU. Through computational phenotyping, we identified 4575 confirmed and 3092 probable adult PWH and 564 PrEP users. PWH was most identified by a combination of medications and conditions (3324, 43.4%) and drug exposure alone (2191, 28.6%), then less commonly by survey data alone (608, 7.9%) and lab alone (81, 1.1%). Discussion and conclusion: Our methods serve as an overall framework for other researchers using AoU data for conducting HIV-related research. LAY SUMMARY The electronic health record (EHR) data refers to administrative and billing data, electronic medical records, or other digital records of information pertinent to individual or population health. In this study, we fully leveraged the information in EHR data and survey data from All of Us (AoU) platform to identify potential people with HIV (PWH) and individuals who are taking pre-exposure prophylaxis (PrEP) for HIV prevention. The AoU Research Program aims to recruit participants from groups that have been historically underrepresented in biomedical research. Using information from different domains of EHR data (ie, diagnostic code, drug prescription record, and lab results) and questions in the Survey, we conducted computational phenotyping (ie, the process of transforming the noisy, massive EHR data into meaningful medical concepts that can be used for case detection or predict the risk of disease for an individual) and identified 4575 confirmed and 3092 probable adult PWH and 564 PrEP users. PWH was most identified by a combination of medications and conditions and drug exposure alone, then less commonly by survey data alone and lab alone. Our methods for the HIV and PrEP case detection could help other researchers using AoU data for conducting HIV-related research.
引用
收藏
页数:6
相关论文
共 11 条
[1]   Automatic generation of case-detection algorithms to identify children with asthma from large electronic health record databases [J].
Afzal, Zubair ;
Engelkes, Marjolein ;
Verhamme, Katia M. C. ;
Janssens, Hettie M. ;
Sturkenboom, Miriam C. J. M. ;
Kors, Jan A. ;
Schuemie, Martijn J. .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2013, 22 (08) :826-833
[2]   The "All of Us" Research Program [J].
Denny J.C. ;
Rutter J.L. ;
Goldstein D.B. ;
Philippakis A. ;
Smoller J.W. ;
Jenkins G. ;
Dishman E. .
NEW ENGLAND JOURNAL OF MEDICINE, 2019, 381 (07) :668-676
[3]   Development of an electronic medical record-based algorithm to identify patients with unknown HIV status [J].
Felsen, Uriel R. ;
Bellin, Eran Y. ;
Cunningham, Chinazo O. ;
Zingman, Barry S. .
AIDS CARE-PSYCHOLOGICAL AND SOCIO-MEDICAL ASPECTS OF AIDS/HIV, 2014, 26 (10) :1318-1325
[4]   Methods to Develop an Electronic Medical Record Phenotype Algorithm to Compare the Risk of Coronary Artery Disease across 3 Chronic Disease Cohorts [J].
Liao, Katherine P. ;
Ananthakrishnan, Ashwin N. ;
Kumar, Vishesh ;
Xia, Zongqi ;
Cagan, Andrew ;
Gainer, Vivian S. ;
Goryachev, Sergey ;
Chen, Pei ;
Savova, Guergana K. ;
Agniel, Denis ;
Churchill, Susanne ;
Lee, Jaeyoung ;
Murphy, Shawn N. ;
Plenge, Robert M. ;
Szolovits, Peter ;
Kohane, Isaac ;
Shaw, Stanley Y. ;
Karlson, Elizabeth W. ;
Cai, Tianxi .
PLOS ONE, 2015, 10 (08)
[5]   Optimizing Identification of People Living with HIV from Electronic Medical Records: Computable Phenotype Development and Validation [J].
Liu, Yiyang ;
Siddiqi, Khairul A. ;
Cook, Robert L. ;
Bian, Jiang ;
Squires, Patrick J. ;
Shenkman, Elizabeth A. ;
Prosperi, Mattia ;
Jayaweera, Dushyantha T. .
METHODS OF INFORMATION IN MEDICINE, 2021, 60 (03/04) :84-94
[6]   Diversity and inclusion for theAll of Usresearch program: A scoping review [J].
Mapes, Brandy M. ;
Foster, Christopher S. ;
Kusnoor, Sheila V. ;
Epelbaum, Marcia I. ;
AuYoung, Mona ;
Jenkins, Gwynne ;
Lopez-Class, Maria ;
Richardson-Heron, Dara ;
Elmi, Ahmed ;
Surkan, Karl ;
Cronin, Robert M. ;
Wilkins, Consuelo H. ;
Perez-Stable, Eliseo J. ;
Dishman, Eric ;
Denny, Joshua C. ;
Rutter, Joni L. .
PLOS ONE, 2020, 15 (07)
[7]  
National COVID Cohort Collaborative, CONC IDS
[8]  
National Institutes of Health, 2017, ALL US RES PROGR IN
[9]  
NIH All of Us Research Program, ALL US RES HUB DAT S
[10]   Development and validation of an electronic medical record (EMR)-based computed phenotype of HIV-1 infection [J].
Paul, Devon W. ;
Neely, Nigel B. ;
Clement, Meredith ;
Riley, Isaretta ;
Al-Hegelan, Mashael ;
Phelan, Matthew ;
Kraft, Monica ;
Murdoch, David M. ;
Lucas, Joseph ;
Bartlett, John ;
McKellar, Mehri ;
Que, Loretta G. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (02) :150-157