Classifying publications from the clinical and translational science award program along the translational research spectrum: a machine learning approach

被引:31
|
作者
Surkis, Alisa [1 ]
Hogle, Janice A. [2 ]
DiazGranados, Deborah [3 ]
Hunt, Joe D. [4 ]
Mazmanian, Paul E. [3 ]
Connors, Emily [5 ]
Westaby, Kate [6 ]
Whipple, Elizabeth C. [7 ]
Adamus, Trisha [8 ]
Mueller, Meridith [9 ]
Aphinyanaphongs, Yindalon [10 ]
机构
[1] NYU, Sch Med, Hlth Sci Lib, New York, NY USA
[2] Univ Wisconsin, Inst Clin & Translat Res, Sch Med & Publ Hlth, Madison, WI USA
[3] Virginia Commonwealth Univ, Sch Med, Richmond, VA USA
[4] Indiana Univ Sch Med, Indiana Clin & Translat Sci Inst, Indianapolis, IN 46202 USA
[5] Med Coll Wisconsin, Clin & Translat Sci Inst, Milwaukee, WI 53226 USA
[6] Univ Wisconsin, Sch Med & Publ Hlth, Wisconsin Partnership Program, Madison, WI USA
[7] Indiana Univ Sch Med, Ruth Lilly Med Lib, Indianapolis, IN 46202 USA
[8] Univ Wisconsin, Sch Med & Publ Hlth, Ebling Lib Hlth Sci, Madison, WI USA
[9] Univ Wisconsin, Sch Med & Publ Hlth, Populat Hlth Sci, Madison, WI USA
[10] NYU, Sch Med, Dept Populat Hlth, New York, NY USA
关键词
Machine learning; Translational research; Knowledge translation; Text classification; HEALTH-CARE; CLASSIFICATION;
D O I
10.1186/s12967-016-0992-8
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Background: Translational research is a key area of focus of the National Institutes of Health (NIH), as demonstrated by the substantial investment in the Clinical and Translational Science Award (CTSA) program. The goal of the CTSA program is to accelerate the translation of discoveries from the bench to the bedside and into communities. Different classification systems have been used to capture the spectrum of basic to clinical to population health research, with substantial differences in the number of categories and their definitions. Evaluation of the effectiveness of the CTSA program and of translational research in general is hampered by the lack of rigor in these definitions and their application. This study adds rigor to the classification process by creating a checklist to evaluate publications across the translational spectrum and operationalizes these classifications by building machine learning-based text classifiers to categorize these publications. Methods: Based on collaboratively developed definitions, we created a detailed checklist for categories along the translational spectrum from T0 to T4. We applied the checklist to CTSA-linked publications to construct a set of coded publications for use in training machine learning-based text classifiers to classify publications within these categories. The training sets combined T1/T2 and T3/T4 categories due to low frequency of these publication types compared to the frequency of T0 publications. We then compared classifier performance across different algorithms and feature sets and applied the classifiers to all publications in PubMed indexed to CTSA grants. To validate the algorithm, we manually classified the articles with the top 100 scores from each classifier. Results: The definitions and checklist facilitated classification and resulted in good inter-rater reliability for coding publications for the training set. Very good performance was achieved for the classifiers as represented by the area under the receiver operating curves (AUC), with an AUC of 0.94 for the T0 classifier, 0.84 for T1/T2, and 0.92 for T3/T4. Conclusions: The combination of definitions agreed upon by five CTSA hubs, a checklist that facilitates more uniform definition interpretation, and algorithms that perform well in classifying publications along the translational spectrum provide a basis for establishing and applying uniform definitions of translational research categories. The classification algorithms allow publication analyses that would not be feasible with manual classification, such as assessing the distribution and trends of publications across the CTSA network and comparing the categories of publications and their citations to assess knowledge transfer across the translational research spectrum.
引用
收藏
页数:14
相关论文
共 12 条
  • [1] Classifying publications from the clinical and translational science award program along the translational research spectrum: a machine learning approach
    Alisa Surkis
    Janice A. Hogle
    Deborah DiazGranados
    Joe D. Hunt
    Paul E. Mazmanian
    Emily Connors
    Kate Westaby
    Elizabeth C. Whipple
    Trisha Adamus
    Meridith Mueller
    Yindalon Aphinyanaphongs
    Journal of Translational Medicine, 14
  • [2] Machine learning to promote translational research: predicting patent and clinical trial inclusion in dementia research
    Beinat, Matilda
    Beinat, Julian
    Shoaib, Mohammed
    Magenti, Jorge Gomez
    BRAIN COMMUNICATIONS, 2024, 6 (04)
  • [3] Developing the Translational Research Workforce: A Pilot Study of Common Metrics for Evaluating the Clinical and Translational Award KL2 Program
    Schneider, Margaret
    Guerrero, Lourdes
    Jones, Lisa B.
    Tong, Greg
    Ireland, Christine
    Dumbauld, Jill
    Rainwater, Julie
    CTS-CLINICAL AND TRANSLATIONAL SCIENCE, 2015, 8 (06): : 662 - 667
  • [4] Development of TRACER: A Translational Research Accomplishments Cataloguer for Clinical and Translational Science Award hub activity tracking, evaluation, and decision-making
    Sperling, Jessica
    Quenstedt, Stella
    Leiro, Anthony
    Muhigaba, Perusi B.
    McClernon, F. Joseph
    JOURNAL OF CLINICAL AND TRANSLATIONAL SCIENCE, 2024, 8 (01)
  • [5] An analysis of the Clinical and Translational Science Award pilot project portfolio using data from Research Performance Progress Reports
    Klein, Sean A.
    Baiocchi, Michael
    Rodu, Jordan
    Baker, Heather
    Rosemond, Erica
    Doyle, Jamie Mihoko
    JOURNAL OF CLINICAL AND TRANSLATIONAL SCIENCE, 2022, 6 (01)
  • [6] A Community Translational Research Pilot Grants Program to Facilitate Community-Academic Partnerships: Lessons From Colorado's Clinical Translational Science Awards
    Main, Deborah S.
    Felzien, Maret C.
    Magid, David J.
    Calonge, B. Ned
    O'Brien, Ruth A.
    Kempe, Allison
    Nearing, Kathryn
    PROGRESS IN COMMUNITY HEALTH PARTNERSHIPS-RESEARCH EDUCATION AND ACTION, 2012, 6 (03) : 381 - 387
  • [7] Evaluating Various Areas of Process Improvement in an Effort to Improve Clinical Research: Discussions from the 2012 Clinical Translational Science Award (CTSA) Clinical Research Management Workshop
    Strasser, Jane E.
    Cola, Philip A.
    Rosenblum, Daniel
    CTS-CLINICAL AND TRANSLATIONAL SCIENCE, 2013, 6 (04): : 317 - 320
  • [8] Machine learning and multi-omics integration: advancing cardiovascular translational research and clinical practice
    Lin, Mingzhi
    Guo, Jiuqi
    Gu, Zhilin
    Tang, Wenyi
    Tao, Hongqian
    You, Shilong
    Jia, Dalin
    Sun, Yingxian
    Jia, Pengyu
    JOURNAL OF TRANSLATIONAL MEDICINE, 2025, 23 (01)
  • [9] The Texas Health Resources Clinical Scholars Program: Learning healthcare system workforce development through embedded translational research
    Masica, Andrew L.
    Velasco, Ferdinand
    Nelson, Tanna L.
    Medford, Richard J.
    Hughes, Amy E.
    Pandey, Ambarish
    Peterson, Eric D.
    Lehmann, Christoph U.
    LEARNING HEALTH SYSTEMS, 2022, 6 (04):
  • [10] Re-engineering The Clinical Research Enterprise in Response to COVID-19: The Clinical Translational Science Award (CTSA) experience and proposed playbook for future pandemics
    Coller, Barry S.
    Buse, John B.
    Kimberly, Robert P.
    Powderly, William G.
    Zand, Martin S.
    JOURNAL OF CLINICAL AND TRANSLATIONAL SCIENCE, 2021, 5 (01)