Quantifying Sex Bias in Clinical Studies at Scale With Automated Data Extraction

被引:99
作者
Feldman, Sergey [1 ]
Ammar, Waleed [1 ]
Lo, Kyle [1 ]
Trepman, Elly [1 ,2 ,3 ]
van Zuylen, Madeleine [1 ]
Etzioni, Oren [1 ]
机构
[1] Allen Inst Artificial Intelligence, 2157 N Northlake Way,Ste 110, Seattle, WA 98103 USA
[2] Univ S Alabama, Coll Med, Mobile, AL USA
[3] Univ Manitoba, Dept Med Microbiol & Infect Dis, Winnipeg, MB, Canada
关键词
GENDER BIAS; TRIALS; WOMEN; REPRESENTATION; AGE; RACE/ETHNICITY; PARTICIPATION; ENROLLMENT; CARE;
D O I
10.1001/jamanetworkopen.2019.6700
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
IMPORTANCE Analyses of female representation in clinical studies have been limited in scope and scale. OBJECTIVE To perform a large-scale analysis of global enrollment sex bias in clinical studies. DESIGN, SETTING, AND PARTICIPANTS In this cross-sectional study, clinical studies from published articles from PubMed from 19 66 to 2018 and records from Aggregate Analysis of ClinicalTrials.gov from 19 99 to 2018 were identified. Global disease prevalence was determined for male and female patients in 11 disease categories from the Global Burden of Disease database: cardiovascular, diabetes, digestive, hepatitis (types A, B, C, and E), HIV/AIDS, kidney (chronic), mental, musculoskeletal, neoplasms, neurological, and respiratory (chronic). Machine reading algorithms were developed that extracted sex data from tables in articles and records on December 31, 2018, at an artificial intelligence research institute. Male and female participants in 43 135 articles (792 004 915 participants) and 13 165 records (12 977103 participants) were included. MAIN OUTCOMES AND MEASURES Sex bias was defined as the difference between the fraction of female participants in study participants minus prevalence fraction of female participants for each disease category. A total of 1000 bootstrap estimates of sex bias were computed by resampling individual studies with replacement. Sex bias was reported as mean and 95% bootstrap confidence intervals from articles and records in each disease category over time (before or during 19 93 to 2018), with studies or participants as the measurement unit. RESULTS There were 792 004 915 participants, including 390 470 834 female participants (49%), in articles and 12 977 103 participants, including 6 351 619 female participants (49%), in records. With studies as measurement unit, substantial female underrepresentation (sex bias <= -0.05) was observed in 7 of 11 disease categories, especially HIV/AIDS (mean for articles, -0.17 [95% CI, -018 to -0.16]), chronic kidney diseases (mean,-0.17 [95% CI,-0.17 to -0.16]), and cardiovascular diseases (mean, -0.14 [95% CI, -014 to -0.13]). Sex bias in articles for all categories combined was unchanged over time with studies as measurement unit (range, -0.15 [95% CI, -016 to -0.13] to -0.10 [95% CI, -0.14 to -0.06]), but improved from before or during 1993 (mean, -0.11[95% CI, -016 to -0.05]) to 2014 to 2018 (mean, -0.05 [95% CI, -0.09 to -0.02]) with participants as the measurement unit. Larger study size was associated with greater female representation. CONCLUSIONS AND RELEVANCE Automated extraction of the number of participants in clinical reports provides an effective alternative to manual analysis of demographic bias. Despite legal and policy initiatives to increase female representation, sex bias against female participants in clinical studies persists. Studies with more participants have greater female representation. Differences between sex bias estimates with studies vs participants as measurement unit, and between articles vs records, suggest that sex bias with both measures and data sources should be reported.
引用
收藏
页数:14
相关论文
共 46 条
  • [1] Allen Institute for Artificial Intelligence, SEM SCHOL
  • [2] [Anonymous], 2013, BMJ BRIT MED J, DOI DOI 10.1136/BMJ.F2865
  • [3] Machine Reading of Biomedical Data Dictionaries
    Ashish, Naveen
    Patawari, Arihant
    [J]. ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2018, 9 (04):
  • [4] Barnish MS, 2017, PRAGMAT OBS RES, V8, P49, DOI 10.2147/POR.S137701
  • [5] Bhagavatula C, 2018, 2018 C N AM CHAPT AS
  • [6] Choi R., 2015, Washington University Global Studies Law Review, V14, P521, DOI DOI 10.3403/30218727
  • [7] DARPA's Big Mechanism program
    Cohen, Paul R.
    [J]. PHYSICAL BIOLOGY, 2015, 12 (04)
  • [8] Observational Studies to Contextualize Surgical Trials
    Cole, Alexander P.
    Abdollah, Firas
    Quoc-Dien Trinh
    [J]. EUROPEAN UROLOGY, 2016, 70 (02) : 231 - 232
  • [9] Why Observational Studies Should Be Among The Tools Used In Comparative Effectiveness Research
    Dreyer, Nancy A.
    Tunis, Sean R.
    Berger, Marc
    Ollendorf, Dan
    Mattox, Pattra
    Gliklich, Richard
    [J]. HEALTH AFFAIRS, 2010, 29 (10) : 1818 - 1825
  • [10] Etzioni O., 2006, AAAI, V6, P1517, DOI [DOI 10.1145/1298406.1298407, 10.1145/1298406.1298407]