Can internet search engine queries be used to diagnose diabetes? Analysis of archival search data

被引:16
|
作者
Hochberg, Irit [1 ]
Daoud, Deeb [1 ]
Shehadeh, Naim [1 ,2 ]
Yom-Tov, Elad [3 ]
机构
[1] Inst Endocrinol Diabet & Metab, Rambam Hlth Care Campus, 8 Haaliya St,POB 9602, IL-31096 Haifa, Israel
[2] Technion Israel Inst Technol, Bruce Rappaport Fac Med, Haifa, Israel
[3] Microsoft Res, Herzliyya, Israel
关键词
Diabetes; Symptoms; Digital health; Internet;
D O I
10.1007/s00592-019-01350-5
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Aims Diabetes is often diagnosed late. This study aimed to assess the possibility for earlier detection of diabetes from search data, using predictive models trained on large-scale data. Methods We extracted all English-language queries made by people in the USA to Bing during 1 year and identified queries containing symptoms of diabetes. We compared the ability of four different prediction models (linear regression, logistic regression, decision tree and random forest) to distinguish between users who stated that they were diagnosed with diabetes and users who did not refer to diabetes or diabetes drugs but queried about at least one of the symptoms. Results We identified 11,050 "new diabetes users" who stated they had been diagnosed with diabetes and approximately 11.5 million "control users" who queried about symptoms without querying for terms related to diabetes. Both the logistic regression and the random forest models were able to distinguish between the populations with an area under curve of 0.92 which translates to a positive predictive value of 56% at a false-positive rate of 1%. The model could identify patients up to 240 days before they mentioned being diagnosed. Conclusions Some undiagnosed diabetes patients can be detected accurately according to their symptom queries to a search engine. Such earlier diagnosis, especially in cases of type 1 diabetes, could be clinically meaningful. The ability of search engines to serve as a population-wide screening tool could potentially be improved using additional data provided by users.
引用
收藏
页码:1149 / 1154
页数:6
相关论文
共 50 条
  • [21] Internet search engine update
    2001, Online Inc. (25):
  • [22] Internet search engine update
    Notess, GR
    ONLINE, 2000, 24 (02): : 18 - 19
  • [23] Internet search engine update
    Notess, Greg R.
    Online (Wilton, Connecticut), 2002, 26 (03):
  • [24] Internet search engine update
    Online (Wilton, Conn), 5 (13):
  • [25] Internet search engine update
    Notess, G.R.
    2001, Online Inc. (25):
  • [26] Internet search engine update
    Notess, G.R.
    Online (Wilton, Connecticut), 2002, 26 (01):
  • [27] Nongraphic Internet search engine
    不详
    JOURNAL OF VISUAL IMPAIRMENT & BLINDNESS, 1997, 91 (06) : 11 - 12
  • [28] Internet search engine update
    Notess, GR
    ONLINE, 1999, 23 (04): : 16 - 16
  • [29] Internet search engine update
    Notess, GR
    ONLINE, 1999, 23 (05): : 14 - 14
  • [30] Search on the Web with spatial criterions -: Improving a Search Engine with spatial queries
    Corcoles, Jose E.
    Gonzalez, Pascual
    Rodriguez, Marcos
    WEBIST 2007: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL WIA: WEB INTERFACES AND APPLICATIONS, 2007, : 382 - +