In Search of New Product Ideas: Identifying Ideas in Online Communities by Machine Learning and Text Mining

被引:67
作者
Christensen, Kasper [1 ]
Norskov, Sladjana [2 ]
Frederiksen, Lars [2 ]
Scholderer, Joachim [3 ]
机构
[1] Norwegian Univ Life Sci, Dept Math Sci & Technol, As, Norway
[2] Aarhus Univ, Dept Management, Aarhus, Denmark
[3] Aarhus Univ, Dept Econ & Business Econ, Aarhus, Denmark
关键词
ANALYSIS MATRIX; INNOVATION; USERS; CLASSIFICATION; AGREEMENT; QUALITY; MODEL;
D O I
10.1111/caim.12202
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Online communities are attractive sources of ideas relevant for new product development and innovation. However, making sense of the big data' in these communities is a complex analytical task. A systematic way of dealing with these data is needed to exploit their potential for boosting companies' innovation performance. We propose a method for analysing online community data with a special focus on identifying ideas. We employ a research design where two human raters classified 3,000 texts extracted from an online community, according to whether the text contained an idea. Among the 3,000, 137 idea texts and 2,666 non-idea texts were identified. The human raters could not agree on the remaining 197 texts. These texts were omitted from the analysis. The remaining 2,803 texts were processed by using text mining techniques and used to train a classification model. We describe how to tune the model and which text mining steps to perform. We conclude that machine learning and text mining can be useful for detecting ideas in online communities. The method can help researchers and firms identify ideas hidden in large amounts of texts. Also, it is interesting in its own right that machine learning can be used to detect ideas.
引用
收藏
页码:17 / 30
页数:14
相关论文
共 58 条
[1]   A Systematic Comparison of Supervised Classifiers [J].
Amancio, Diego Raphael ;
Comin, Cesar Henrique ;
Casanova, Dalcimar ;
Travieso, Gonzalo ;
Bruno, Odemir Martinez ;
Rodrigues, Francisco Aparecido ;
Costa, Luciano da Fontoura .
PLOS ONE, 2014, 9 (04)
[2]  
[Anonymous], 2008, The Elements of Statistical Learning
[3]  
[Anonymous], 2005, DATA MINING
[4]  
[Anonymous], 2011, DATA MINING TECHNIQU
[5]  
[Anonymous], 2001, EFFECT CLASS DISTRIB
[6]  
[Anonymous], 2011, Pei. data mining concepts and techniques, DOI 10.1016/C2009-0-61819-5
[7]   Mapping the Topic Landscape of JPIM, 1984-2013: In Search of Hidden Structures and Development Trajectories [J].
Antons, David ;
Kleer, Robin ;
Salge, Torsten Oliver .
JOURNAL OF PRODUCT INNOVATION MANAGEMENT, 2016, 33 (06) :726-749
[8]  
Antorini Y.M., 2007, Brand community innovation: An intrinsic case study of the adult fans of LEGO community
[9]  
Antorini YM, 2012, MIT SLOAN MANAGE REV, V53, P73
[10]   Simultaneously Discovering and Quantifying Risk Types from Textual Risk Disclosures [J].
Bao, Yang ;
Datta, Anindya .
MANAGEMENT SCIENCE, 2014, 60 (06) :1371-1391