Automated Industry Classification with Deep Learning

被引:0
作者
Wood, Sam [1 ]
Muthyala, Rohit [2 ]
Jin, Yi [3 ]
Qin, Yixing [3 ]
Rukadikar, Nilaj [3 ]
Gao, Hua [3 ]
Rai, Amit [3 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] NYU, New York, NY USA
[3] EverString Technol, San Mateo, CA USA
来源
2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC) | 2018年
关键词
Big Data; Neural Network; Industry Classification; Multilayer Perceptron; Predictive Marketing; NAICS; Deep Learning;
D O I
10.1109/ICSC.2018.00018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a novel technique for industry classification. Leveraging EverString's API to construct a database of companies labeled with the industries to which they belong, we train a deep neural network to predict the industries of novel companies. We examine the capacity of our model to predict six-digit NAICS codes, as well as the ability of our model architecture to adapt to other industry segmentation schemas. Additionally, we investigate the ability of our model to generalize despite the presence of noise in the labels in our training set. Finally, we explore the possibility of increasing predictive precision by thresholding based on the confidence scores that our model outputs along with its predictions. We find that our approach yields six-digit NAICS code predictions that surpass the precision of gold-standard databases.
引用
收藏
页码:64 / 70
页数:7
相关论文
共 11 条
[1]  
[Anonymous], 2015, NATURE, DOI [10.1038/nature14539, 10.1038/, DOI 10.1038/NATURE14539]
[2]  
[Anonymous], 2016, INT C LEARN REPR
[3]  
Filipe Rodrigues Francisco C Pereira., 2012, Proceedings of the 4th International Conference on Advanced Geographic Information Systems, Applications, and Services, P41
[4]  
Heiden E, 2017, P 3 INT WORKSH DAT S
[5]   Mining point-of-interest data from social networks for urban land use classification and disaggregation [J].
Jiang, Shan ;
Alves, Ana ;
Rodrigues, Filipe ;
Ferreira, Joseph, Jr. ;
Pereira, Francisco C. .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2015, 53 :36-46
[6]  
N. Association, NAICS ASS FAQ
[7]  
Natarajan N., 2013, Advances in neural information processing systems, P1196, DOI DOI 10.5555/2999611.2999745
[8]  
Pierre J, 2001, LINKOPING ELECT ARTI
[9]   Deep learning in neural networks: An overview [J].
Schmidhuber, Juergen .
NEURAL NETWORKS, 2015, 61 :85-117
[10]  
U. C. Bureau, N AM IND CLASS SYST