Estimating the Total Volume of Queries to Google

被引:3
|
作者
Lillo, Fabrizio [1 ]
Ruggieri, Salvatore [2 ]
机构
[1] Univ Bologna, Bologna, Italy
[2] Univ Pisa, Pisa, Italy
来源
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019) | 2019年
关键词
Search engine query; Volume estimation; Zipf's law; Google Trends; POWER-LAW DISTRIBUTIONS;
D O I
10.1145/3308558.3313535
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study the problem of estimating the total volume of queries of a specific domain, which were submitted to the Google search engine in a given time period. Our statistical model assumes a Zipf's law distribution of the population in the reference domain, and a non-uniform or noisy sampling of queries. Parameters of the distribution are estimated using nonlinear least square regression. Estimations with errors are then derived for the total number of queries and for the total number of searches (volume). We apply the method on the recipes and cooking domain, where a sample of queries is collected by crawling popular Italian websites specialized on this domain. The relative volumes of queries in the sample are computed using Google Trends, and transformed to absolute frequencies after estimating a scaling factor. Our model estimates that the volume of Italian recipes and cooking queries submitted to Google in 2017 and with at least 10 monthly searches consists of 7.2B searches.
引用
收藏
页码:1051 / 1060
页数:10
相关论文
共 50 条
  • [1] Estimating the Total Volume of Queries to a Search Engine
    Lillo, Fabrizio
    Ruggieri, Salvatore
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (11) : 5351 - 5363
  • [2] Estimating total discharged volume in uncontrolled oil wells
    Liu, R.
    Kabir, C. S.
    Mannan, M. S.
    Hasan, A. R.
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2017, 156 : 373 - 380
  • [3] Estimating Total Traffic Volume with Statistical Modeling Approach
    Yong, Jiawei
    Wakabayashi, Yuichi
    Okayasu, Akihiro
    Miki, Reiji
    Sasai, Takeyuki
    Inoue, Masaaki
    Fukushima, Shintaro
    IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC, 2022, 2022-October : 304 - 309
  • [4] Estimating Total Traffic Volume with Statistical Modeling Approach
    Yong, Jiawei
    Wakabayashi, Yuichi
    Okayasu, Akihiro
    Miki, Reiji
    Sasai, Takeyuki
    Inoue, Masaaki
    Fukushima, Shintaro
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 304 - 309
  • [5] Sentiment, Google queries and explosivity in the cryptocurrency market
    Agosto, Arianna
    Cerchiello, Paola
    Pagnottoni, Paolo
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 605
  • [6] Using WordNet Glosses to Refine Google Queries
    Nemrava, Jan
    DATESO 2006 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 6TH ANNUAL INTERNATIONAL WORKSHOP, 2006, 176 : 85 - 94
  • [7] Evaluating Google queries based on language preferences
    Al-Eroud, Ahmed F.
    Al-Ramahi, Mohammad A.
    Al-Kabi, Mohammed N.
    Alsmadi, Izzat M.
    Al-Shawakfa, Emad M.
    JOURNAL OF INFORMATION SCIENCE, 2011, 37 (03) : 282 - 292
  • [8] Google Search Queries, Foreclosures, and House Prices
    Damianov, Damian S.
    Wang, Xiangdong
    Yan, Cheng
    JOURNAL OF REAL ESTATE FINANCE AND ECONOMICS, 2021, 63 (02): : 177 - 209
  • [9] Google Search Queries, Foreclosures, and House Prices
    Damian S. Damianov
    Xiangdong Wang
    Cheng Yan
    The Journal of Real Estate Finance and Economics, 2021, 63 : 177 - 209
  • [10] Volume queries in polyhedra
    Iacono, J
    Langerman, S
    DISCRETE AND COMPUTATIONAL GEOMETRY, 2001, 2098 : 156 - 159