Automatic identification of high impact articles in PubMed to support clinical decision making

被引:9
作者
Bian, Jiantao [1 ]
Morid, Mohammad Amin [2 ]
Jonnalagadda, Siddhartha [3 ]
Luo, Gang [4 ]
Del Fiol, Guilherme [1 ]
机构
[1] Univ Utah, Dept Biomed Informat, 421 Wakara Way, Salt Lake City, UT 84108 USA
[2] Univ Utah, David Eccles Sch Business, Dept Operat & Informat Syst, Salt Lake City, UT 84108 USA
[3] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA
[4] Univ Washington, Dept Biomed Informat & Med Educ, Seattle, WA 98195 USA
关键词
QUESTIONS; RETRIEVAL; QUALITY; MANAGEMENT; GUIDELINE; QUERIES; NEEDS;
D O I
10.1016/j.jbi.2017.07.015
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objectives: The practice of evidence-based medicine involves integrating the latest best available evidence into patient care decisions. Yet, critical barriers exist for clinicians' retrieval of evidence that is relevant for a particular patient from primary sources such as randomized controlled trials and meta analyses. To help address those barriers, we investigated machine learning algorithms that find clinical studies with high clinical impact from PubMed (R). Methods: Our machine learning algorithms use a variety of features including bibliometric features (e.g., citation count), social media attention, journal impact factors, and citation metadata. The algorithms were developed and evaluated with a gold standard composed of 502 high impact clinical studies that are referenced in 11 clinical evidence-based guidelines on the treatment of various diseases. We tested the following hypotheses: (1) our high impact classifier outperforms a state-of-the-art classifier based on citation metadata and citation terms, and PubMed's (R) relevance sort algorithm; and (2) the performance of our high impact classifier does not decrease significantly after removing proprietary features such as citation count. Results: The mean top 20 precision of our high impact classifier was 34% versus 11% for the state-of-the-art classifier and 4% for PubMed's (R) relevance sort (p = 0.009); and the performance of our high impact classifier did not decrease significantly after removing proprietary features (mean top 20 precision = 34% vs. 36%; p = 0.085). Conclusion: The high impact classifier, using features such as bibliometrics, social media attention and MEDLINE (R) metadata, outperformed previous approaches and is a promising alternative to identifying high impact studies for clinical decision support. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:95 / 103
页数:9
相关论文
共 62 条
  • [1] Almenoff P., 2007, VA/DoD clinical practice guideline for management of outpatient COPD, P138
  • [2] [Anonymous], 1999, STAT MED, V18, P1905
  • [3] [Anonymous], 1970, NEW ENGL J MED, V282, P220
  • [4] [Anonymous], 1996, J CAN CHIROPR ASS, V40, P47
  • [5] Text categorization models for high-quality article retrieval in internal medicine
    Aphinyanaphongs, Y
    Tsamardinos, I
    Statnikov, A
    Hardin, D
    Aliferis, CF
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2005, 12 (02) : 207 - 216
  • [6] A comparison of citation metrics to machine learning filters for the identification of high quality MEDLINE documents
    Aphinyanaphongs, Yindalon
    Statnikov, Alexander
    Aliferis, Constantin F.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2006, 13 (04) : 446 - 455
  • [7] An Analysis of Altmetrics in Emergency Medicine
    Barbic, David
    Tubman, Michelle
    Lam, Henry
    Barbic, Skye
    [J]. ACADEMIC EMERGENCY MEDICINE, 2016, 23 (03) : 251 - 268
  • [8] Using citation data to improve retrieval from MEDLINE
    Bernstam, EV
    Herskovic, JR
    Aphinyanaphongs, Y
    Aliferis, CF
    Sriram, MG
    Hersh, WR
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2006, 13 (01) : 96 - 105
  • [9] What do citation counts measure? A review of studies on citing behavior
    Bornmann, Luti
    Daniel, Hans-Dieter
    [J]. JOURNAL OF DOCUMENTATION, 2008, 64 (01) : 45 - 80
  • [10] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117