Hey Google! will New Zealand vote to legalise cannabis? Using Google Trends data to predict the outcome of the 2020 New Zealand cannabis referendum

被引:8
作者
Raubenheimer, Jacques Eugene [1 ]
Riordan, Benjamin C. [2 ]
Merrill, Jennifer E. [3 ]
Winter, Taylor [4 ]
Ward, Rose Marie [5 ]
Scarf, Damian [6 ]
Buckley, Nicholas A. [1 ]
机构
[1] Univ Sydney, Discipline Biomed Informat & Digital Hlth, Sydney, NSW, Australia
[2] La Trobe Univ, Ctr Alcohol Policy Res CAPR, Melbourne, Vic, Australia
[3] Brown Univ, Ctr Alcohol & Addict Studies, Sch Publ Hlth, Providence, RI USA
[4] Victoria Univ Wellington, Sch Psychol, Wellington, New Zealand
[5] Miami Univ, Kinesiol & Hlth, Oxford, OH 45056 USA
[6] Univ Otago, Dept Psychol, Dunedin, New Zealand
基金
英国医学研究理事会;
关键词
Google Trends; Google Trends extended for health; Cannabis; Cannabis legalisation; Referendum; New Zealand; ACCURACY; TWITTER; POLLS;
D O I
10.1016/j.drugpo.2020.103083
中图分类号
R194 [卫生标准、卫生检查、医药管理];
学科分类号
摘要
Background: New Zealand held a referendum on the legalisation of recreational cannabis in October 2020. Polls preceding the referendum provided contrasting outcomes. We investigated whether internet search data from Google Trends could provide an alternative estimate of the referendum outcome. Methods: We assessed various methods for accessing Google Trends data, downloading search probability data for google.com searches from New Zealand via trends.google.com, PyTrends and Google Trends Extended for Health. We used daily data for the three months prior to the final referendum date, and hourly data for the final week. We defined two smaller time frames each from daily and hourly data, allowing comparisons over the entire time frames, and progressively closer to the end. Using the selected keyword combination of 'cannabis referendum yes/no' we calculated the proportions of 'yes' and 'no' searches for each time frame/data source combination, aiming for a prediction within 2% of the final result. Results: Data from different sources varied slightly. The method used to aggregate search probabilities over the selected time frame (mean/median) resulted in changes in the predicted outcome for hourly-, but not daily data. On 20 October we predicted the 'no' vote at 51.9%-55.4% for daily-, and 60% for hourly data when aggregated using the median, but only 49% for mean hourly data. Hourly data performed poorly at predicting the final 51.2% 'no' result, while predictions based on mean daily data for the full voting period provided the best prediction, differing by 0.1-0.2%. Conclusion: Predictions based on Google Trends data broadly agreed with polling predictions, but the exact method used affected the eventual prediction. While polls are subject to influence from methodological considerations (e.g., sampling), it is clear that Google Trends data can be used to make a prediction, but do not present a magic bullet solution to polling problems.
引用
收藏
页数:10
相关论文
共 75 条
  • [1] Al-Imam A., 2018, REAL TIME INFE UNPUB, DOI [10.20944/PREPRINTS201811.0260.V2, DOI 10.20944/PREPRINTS201811.0260.V2]
  • [2] What the fake? Assessing the extent of networked political spamming and bots in the propagation of #fakenews on Twitter
    Al-Rawi, Ahmed
    Groshek, Jacob
    Zhang, Li
    [J]. ONLINE INFORMATION REVIEW, 2019, 43 (01) : 53 - 71
  • [3] [Anonymous], 2015, HANDS ON GUIDE GOOGL
  • [4] Google Trends: Opportunities and limitations in health and health policy research
    Arora, Vishal S.
    McKee, Martin
    Stuckler, David
    [J]. HEALTH POLICY, 2019, 123 (03) : 338 - 341
  • [5] Bermingham A., 2011, P WORKSH SENT AN AI, P2
  • [6] Prediction and analysis of Indonesia Presidential election from Twitter using sentiment analysis
    Budiharto, Widodo
    Meiliana, Meiliana
    [J]. JOURNAL OF BIG DATA, 2018, 5 (01)
  • [7] 140 characters to victory?: Using Twitter to predict the UK 2015 General Election
    Burnap, Pete
    Gibson, Rachel
    Sloan, Luke
    Southern, Rosalynd
    Williams, Matthew
    [J]. ELECTORAL STUDIES, 2016, 41 : 230 - 233
  • [8] THE DARK SIDE OF PUBLISHING
    Butler, Declan
    [J]. NATURE, 2013, 495 (7442) : 433 - 435
  • [9] Predicting the Present with Google Trends
    Choi, Hyunyoung
    Varian, Hal
    [J]. ECONOMIC RECORD, 2012, 88 : 2 - 9
  • [10] Chu B, 2020, ECON BULL, V40