Machine learning and statistical models for analyzing multilevel patent data

被引:0
|
作者
Sunyun Qi
Yu Zhang
Hua Gu
Fei Zhu
Meiying Gao
Hongxiao Liang
Qifeng Zhang
Yanchao Gao
机构
[1] Zhejiang Provincial Center for Medical Science Technology and Education Development,Leuven Statistics Research Centre, Faculty of Science
[2] KU Leuven (Katholieke Universiteit Leuven),Department of Public Utilities Management, Faculty of Humanities and Management
[3] Zhejiang Chinese Medical University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
A recent surge of patent applications among public hospitals in China has aroused significant research interest. A country’s healthcare innovation capacity can be measured by its number of patents. This paper explores the link between the number of patents and ten independent variables. Multicollinearity was carefully detected and removed by using the variable selection method and LASSO regression, respectively. The Poisson model and the negative binomial model were proposed to analyze the patent data. Three goodness of fit tests, the Pearson test, the deviance test, and the DHARMa non-parametric dispersion test, were conducted to investigate if the model has a good fit. After discovering four clusters by conducting agglomerative hierarchical clustering, these two models were replaced by the negative binomial mixed model. The likelihood ratio test was used to determine which model is more appropriate and the results reveal that the negative binomial mixed model outperforms both the Poisson model and the negative binomial model. Three variables, number of health technicians per 10,000 people, financial expenditure on science and technology as well as number of patent applications per 10,000 health personnel, have a significantly positive relationship with the number of patents in Chinese tertiary public hospitals.
引用
收藏
相关论文
共 50 条
  • [41] A Bayesian perspective of statistical machine learning for big data
    Sambasivan, Rajiv
    Das, Sourish
    Sahu, Sujit K.
    COMPUTATIONAL STATISTICS, 2020, 35 (03) : 893 - 930
  • [42] Data science and machine learning: Mathematical and statistical methods
    Lai, Yin-Ju
    Hsiao, Chuhsing Kate
    Botev, Zdravko
    BIOMETRICS, 2021, 77 (04) : 1503 - 1504
  • [43] Random logistic machine (RLM): Transforming statistical models into machine learning approach
    Li, Yu-Shan
    Guo, Chao-Yu
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024, 53 (21) : 7517 - 7525
  • [44] Data Acquisition for Improving Machine Learning Models
    Li, Yifan
    Yu, Xiaohui
    Koudas, Nick
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (10): : 1832 - 1844
  • [45] An approach for analyzing urban carbon emissions using machine learning models
    Gao, Peidao
    Zhu, Chaoyong
    Zhang, Yang
    Chen, Bo
    INDOOR AND BUILT ENVIRONMENT, 2023, 32 (08) : 1657 - 1667
  • [46] Analyzing Machine Learning Models with Gaussian Process for the Indoor Positioning System
    Xie, Yunxin
    Zhu, Chenyang
    Jiang, Wei
    Bi, Jia
    Zhu, Zhengwei
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [47] Analyzing machine learning models to accelerate generation of fundamental materials insights
    Mitsutaro Umehara
    Helge S. Stein
    Dan Guevarra
    Paul F. Newhouse
    David A. Boyd
    John M. Gregoire
    npj Computational Materials, 5
  • [48] Analyzing machine learning models to accelerate generation of fundamental materials insights
    Umehara, Mitsutaro
    Stein, Helge S.
    Guevarra, Dan
    Newhouse, Paul F.
    Boyd, David A.
    Gregoire, John M.
    NPJ COMPUTATIONAL MATERIALS, 2019, 5 (1)
  • [49] Analyzing Credit Card Fraud Detection based on Machine Learning Models
    Almutairi, Raghad
    Godavarthi, Abhishek
    Kotha, Arthi Reddy
    Ceesay, Ebrima
    2022 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2022, : 988 - 995
  • [50] Integration of machine learning and statistical models for crash frequency modeling
    Zhou, Dongqin
    Gayah, Vikash V.
    Wood, Jonathan S.
    TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2023, 15 (10): : 1408 - 1419