Predicting innovative firms using web mining and deep learning

被引:28
|
作者
Kinne, Jan [1 ,2 ,3 ]
Lenz, David [3 ,4 ]
机构
[1] ZEW Ctr European Econ Res, Dept Econ Innovat & Ind Dynam, Mannheim, Germany
[2] Univ Salzburg, Dept Geoinformat Z GIS, Salzburg, Austria
[3] Istari Ai, Mannheim, Germany
[4] Justus Liebig Univ, Dept Econometr & Stat, Giessen, Germany
来源
PLOS ONE | 2021年 / 16卷 / 04期
关键词
PATENT STATISTICS; NEURAL-NETWORKS;
D O I
10.1371/journal.pone.0249071
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Evidence-based STI (science, technology, and innovation) policy making requires accurate indicators of innovation in order to promote economic growth. However, traditional indicators from patents and questionnaire-based surveys often lack coverage, granularity as well as timeliness and may involve high data collection costs, especially when conducted at a large scale. Consequently, they struggle to provide policy makers and scientists with the full picture of the current state of the innovation system. In this paper, we propose a first approach on generating web-based innovation indicators which may have the potential to overcome some of the shortcomings of traditional indicators. Specifically, we develop a method to identify product innovator firms at a large scale and very low costs. We use traditional firm-level indicators from a questionnaire-based innovation survey (German Community Innovation Survey) to train an artificial neural network classification model on labelled (product innovator/no product innovator) web texts of surveyed firms. Subsequently, we apply this classification model to the web texts of hundreds of thousands of firms in Germany to predict whether they are product innovators or not. We then compare these predictions to firm-level patent statistics, survey extrapolation benchmark data, and regional innovation indicators. The results show that our approach produces reliable predictions and has the potential to be a valuable and highly cost-efficient addition to the existing set of innovation indicators, especially due to its coverage and regional granularity.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Predicting the Landscape of Recombination Using Deep Learning
    Adrion, Jeffrey R.
    Galloway, Jared G.
    Kern, Andrew D.
    MOLECULAR BIOLOGY AND EVOLUTION, 2020, 37 (06) : 1790 - 1808
  • [2] Predicting process behaviour using deep learning
    Evermann, Joerg
    Rehse, Jana-Rebecca
    Fettke, Peter
    DECISION SUPPORT SYSTEMS, 2017, 100 : 129 - 140
  • [3] Intruder Detection Using Deep Learning and Association Rule Mining
    Thilina, Asantha
    Attanayake, Shakthi
    Samarakoon, Sacith
    Nawodya, Dahami
    Rupasinghe, Lakmal
    Pathirage, Nadith
    Edirisinghe, Tharindu
    Krishnadeva, Kesavan
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2016, : 615 - 620
  • [4] Predicting carbon peak at the provincial level using deep learning
    Tang, Xiaoyan
    Fang, Kunsheng
    ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2025, 7 (02):
  • [5] Method of predicting human mobility patterns using deep learning
    Kim, Dong Yup
    Song, Ha Yoon
    NEUROCOMPUTING, 2018, 280 : 56 - 64
  • [6] Predicting NEPSE index price using deep learning models
    Pokhrel, Nawa Raj
    Dahal, Keshab Raj
    Rimal, Ramchandra
    Bhandari, Hum Nath
    Khatri, Rajendra K. C.
    Rimal, Binod
    Hahn, William Edward
    MACHINE LEARNING WITH APPLICATIONS, 2022, 9
  • [7] Web Application Firewall Based on Anomaly Detection using Deep Learning
    Toprak, Sezer
    Yavuz, Ali Gokhan
    ACTA INFOLOGICA, 2022, 6 (02): : 219 - 244
  • [8] Predicting distresses using deep learning of text segments in annual reports
    Matin, Rastin
    Hansen, Casper
    Hansen, Christian
    Molgaard, Pia
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 132 : 199 - 208
  • [9] On the platform but will they buy? Predicting customers' purchase behavior using deep learning
    Chaudhuri, Neha
    Gupta, Gaurav
    Vamsi, Vallurupalli
    Bose, Indranil
    DECISION SUPPORT SYSTEMS, 2021, 149
  • [10] Predicting systemic risk in financial systems using Deep Graph Learning
    Balmaseda, Vicente
    Coronado, Maria
    de Cadenas-Santiago, Gonzalo
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 19