Machine learning and statistical models for analyzing multilevel patent data

被引:0
|
作者
Sunyun Qi
Yu Zhang
Hua Gu
Fei Zhu
Meiying Gao
Hongxiao Liang
Qifeng Zhang
Yanchao Gao
机构
[1] Zhejiang Provincial Center for Medical Science Technology and Education Development,Leuven Statistics Research Centre, Faculty of Science
[2] KU Leuven (Katholieke Universiteit Leuven),Department of Public Utilities Management, Faculty of Humanities and Management
[3] Zhejiang Chinese Medical University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
A recent surge of patent applications among public hospitals in China has aroused significant research interest. A country’s healthcare innovation capacity can be measured by its number of patents. This paper explores the link between the number of patents and ten independent variables. Multicollinearity was carefully detected and removed by using the variable selection method and LASSO regression, respectively. The Poisson model and the negative binomial model were proposed to analyze the patent data. Three goodness of fit tests, the Pearson test, the deviance test, and the DHARMa non-parametric dispersion test, were conducted to investigate if the model has a good fit. After discovering four clusters by conducting agglomerative hierarchical clustering, these two models were replaced by the negative binomial mixed model. The likelihood ratio test was used to determine which model is more appropriate and the results reveal that the negative binomial mixed model outperforms both the Poisson model and the negative binomial model. Three variables, number of health technicians per 10,000 people, financial expenditure on science and technology as well as number of patent applications per 10,000 health personnel, have a significantly positive relationship with the number of patents in Chinese tertiary public hospitals.
引用
收藏
相关论文
共 50 条
  • [21] Machine Learning and Statistical Models to Predict Postpartum Hemorrhage
    Young, Roger C.
    OBSTETRICS AND GYNECOLOGY, 2020, 136 (01): : 194 - 195
  • [22] Assessing Hydrological Simulations with Machine Learning and Statistical Models
    Rozos, Evangelos
    HYDROLOGY, 2023, 10 (02)
  • [23] Machine Learning: Deepest Learning as Statistical Data Assimilation Problems
    Abarbanel, Henry D., I
    Rozdeba, Paul J.
    Shirman, Sasha
    NEURAL COMPUTATION, 2018, 30 (08) : 2025 - 2055
  • [24] Statistical and machine learning models for predicting spalling in CRCP
    Al-Khateeb, Ghazi G.
    Alnaqbi, Ali
    Zeiada, Waleed
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [25] Machine Learning and Statistical Models to Predict Postpartum Hemorrhage
    Venkatesh, Kartik K.
    Strauss, Robert A.
    Grotegut, Chad A.
    Heine, R. Philip
    Chescheir, Nancy C.
    Stringer, Jeffrey S. A.
    Stamilio, David M.
    Menard, Katherine M.
    Jelovsek, J. Eric
    OBSTETRICS AND GYNECOLOGY, 2020, 135 (04): : 935 - 944
  • [26] FORECASTING UNDER APPLYING MACHINE LEARNING AND STATISTICAL MODELS
    Elhag, Azhari A.
    Abu-Zinadah, Hanaa
    Elhag, Azhari A. (a.alhag@tu.edu.sa), 1600, Serbian Society of Heat Transfer Engineers (24):
  • [27] FORECASTING UNDER APPLYING MACHINE LEARNING AND STATISTICAL MODELS
    Elhag, Azhari A.
    Abu-Zinadah, Hanaa
    THERMAL SCIENCE, 2020, 24 : S131 - S137
  • [28] Statistical quantification of confounding bias in machine learning models
    Spisak, Tamas
    GIGASCIENCE, 2022, 11
  • [29] Machine-learning models for analyzing TSOM images of nanostructures
    Qu, Yufu
    Hao, Jialin
    Peng, Renju
    OPTICS EXPRESS, 2019, 27 (23) : 33979 - 33999
  • [30] Analyzing Effective Factors of Online Learning Performance by Interpreting Machine Learning Models
    Xiao, Wen
    Hu, Juan
    IEEE ACCESS, 2023, 11 : 132435 - 132447