Forecasting AIDS prevalence in the United States using online search traffic data

被引:22
作者
Mavragani A. [1 ]
Ochoa G. [1 ]
机构
[1] Department of Computing Science and Mathematics, Faculty of Natural Sciences, University of Stirling, Stirling
关键词
AIDS; Big data; Forecasting; Google Trends; HIV; Internet; Online behavior;
D O I
10.1186/s40537-018-0126-7
中图分类号
学科分类号
摘要
Over the past decade and with the increasing use of the Internet, the assessment of health issues using online search traffic data has become an integral part of Health Informatics. Internet data in general and from Google Trends in particular have been shown to be valid and valuable in predictions, forecastings, and nowcastings; and in detecting, tracking, and monitoring diseases’ outbreaks and epidemics. Empirical relationships have been shown to exist between Google Trends’ data and official data in several health topics, with the science of infodemiology using the vast amount of information available online for the assessment of public health and policy matters. The aim of this study is to provide a method of forecasting AIDS prevalence in the US using online search traffic data from Google Trends on AIDS related terms. The results at first show that significant correlations between Google Trends’ data and official health data on AIDS prevalence (2004–2015) exist in several States, while the estimated forecasting models for AIDS prevalence show that official health data and Google Trends data on AIDS follow a logarithmic relationship. Overall, the results of this study support previous work on the subject suggesting that Google data are valid and valuable for the analysis and forecasting of human behavior towards health topics, and could further assist with Health Assessment in the US and in other countries and regions with valid available official health data. © 2018, The Author(s).
引用
收藏
相关论文
共 42 条
  • [1] Hilbert M., Lopez P., The World’s technological capacity to store, communicate, and compute information, Science, 332, pp. 60-65, (2011)
  • [2] Chen C.L.P., Zhang C.Y., Data-intensive applications, challenges, techniques and technologies: a survey on big data, Inform Sci, 275, pp. 314-347, (2014)
  • [3] Al Nuaimi E., Al Neyadi H., Mohamed N., Al-Jaroodi J., Applications of big data to smart cities, J Int Serv App, 6, (2015)
  • [4] Matthew Herland M., Khoshgoftaar T.M., Wald R., A review of data mining using big data in health informatics, J Big Data, 1, (2014)
  • [5] Preis T., Moat H.S., Stanley H.E., Bishop S.R., Quantifying the advantage of looking forward, Sci Rep, 2, (2012)
  • [6] Preis T., Moat H.S., Stanley H.E., Quantifying trading behavior in financial markets using Google Trends, Sci Rep, 3, (2013)
  • [7] Google Trends
  • [8] Nuti S.V., Wayda B., Ranasinghei I., Wang S., Dreyer R.P., Chen S.I., Murugiah K., The use of Google Trends in health care research: a systematic review, PLoS ONE, 9, (2014)
  • [9] Alicino C., Bragazzi N.L., Faccio V., Amicizia D., Panatto D., Gasparini R., Icardi G., Orsi A., Assessing Ebola-related web search behaviour: insights and implications from an analytical study of Google Trends-based query volumes, Infect Dis Poverty, 4, 1, (2015)
  • [10] Hossain L., Kam D., Kong F., Wigand R.T., Bossomaier T., Social media in Ebola outbreak, Epidemiol Infect, 144, pp. 2136-2143, (2016)