Web-based Startup Success Prediction

被引:26
作者
Sharchilev, Boris [1 ]
Roizner, Michael [1 ]
Rumyantsev, Andrey [1 ]
Ozornin, Denis [1 ]
Serdyukov, Pavel [1 ]
de Rijke, Maarten [2 ]
机构
[1] Yandex, Moscow, Russia
[2] Univ Amsterdam, Amsterdam, Netherlands
来源
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT | 2018年
关键词
Predictive modeling; Heterogeneous web data; Mining open sources; Gradient boosting; BUSINESS; ANGEL;
D O I
10.1145/3269206.3272011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of predicting the success of startup companies at their early development stages. We formulate the task as predicting whether a company that has already secured initial (seed or angel) funding will attract a further round of investment in a given period of time. Previous work on this task has mostly been restricted to mining structured data sources, such as databases of the startup ecosystem consisting of investors, incubators and startups. Instead, we investigate the potential of using web-based open sources for the startup success prediction task and model the task using a very rich set of signals from such sources. In particular, we enrich structured data about the startup ecosystem with information from a business-and employment-oriented social networking service and from the web in general. Using these signals, we train a robust machine learning pipeline encompassing multiple base models using gradient boosting. We show that utilizing companies' mentions on the Web yields a substantial performance boost in comparison to only using structured data about the startup ecosystem. We also provide a thorough analysis of the obtained model that allows one to obtain insights into both the types of useful signals discoverable on the Web and market mechanisms underlying the funding process.
引用
收藏
页码:2283 / 2291
页数:9
相关论文
共 31 条
  • [1] DIFFERENTIAL INFLUENCE OF BLOGS ACROSS DIFFERENT STAGES OF DECISION MAKING: THE CASE OF VENTURE CAPITALISTS
    Aggarwal, Rohit
    Singh, Harpreet
    [J]. MIS QUARTERLY, 2013, 37 (04) : 1093 - +
  • [2] [Anonymous], 2015, ARXIV PREPRINT ARXIV
  • [3] [Anonymous], REV FINANCIAL STUDIE
  • [4] [Anonymous], 1998, Online Algorithms and Stochastic Approximations
  • [5] BIGGADIKE R, 1979, HARVARD BUS REV, V57, P103
  • [6] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [7] CatBoost, 2017, GRAD BOOST DEC TREES
  • [8] CatBoost, 2017, REG FEAT IMP
  • [9] Coyle J. F., 2012, DUKE LAW J, V63
  • [10] Venture capital financing and the growth of startup firms
    Davila, A
    Foster, G
    Gupta, M
    [J]. JOURNAL OF BUSINESS VENTURING, 2003, 18 (06) : 689 - 708