Emerging industry classification based on BERT model

被引:0
作者
Yang, Baocheng [1 ]
Zhang, Bing [1 ]
Cutsforth, Kevin [2 ]
Yu, Shanfu [1 ]
Yu, Xiaowen [3 ]
机构
[1] Huanghe Sci & Technol Univ, Zhengzhou, Peoples R China
[2] Royal Agr Univ, Cirencester, England
[3] Henan Finance Univ, Zhengzhou, Peoples R China
关键词
Industry classification; Machine learning; BERT;
D O I
10.1016/j.is.2024.102484
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate industry classification is central to economic analysis and policy making. Current classification systems, while foundational, exhibit limitations in the face of the exponential growth of big data. These limitations include subjectivity, leading to inconsistencies and misclassifications. To overcome these shortcomings, this paper focuses on utilizing the BERT model for classifying emerging industries through the identification of salient attributes within business descriptions. The proposed method identifies clusters of firms within distinct industries, thereby transcending the restrictions inherent in existing classification systems. The model exhibits an impressive degree of precision in categorizing business descriptions, achieving accuracy rates spanning from 84.11% to 99.66% across all 16 industry classifications. This research enriches the field of industry classification literature through a practical examination of the efficacy of machine learning techniques. Our experiments achieved strong performance, highlighting the effectiveness of the BERT model in accurately classifying and identifying emerging industries, providing valuable insights for industry analysts and policymakers.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] A Smart Contract Vulnerability Detection System Based on BERT Model and Fuzz Testing
    Liang, Zhehao
    Cui, Baojiang
    Wang, Dongbin
    Xu, Jie
    Liu, Huipeng
    INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS 2024, 2024, 214 : 288 - 295
  • [42] Literature Classification Model of Deep Learning Based on BERT-BiLSTM-Taking COVID-19 as an Example
    Li, Zhi
    APPLIED INTELLIGENCE AND INFORMATICS, AII 2021, 2021, 1435 : 336 - 348
  • [43] NeuroPpred-SVM: A New Model for Predicting Neuropeptides Based on Embeddings of BERT
    Liu, Yufeng
    Wang, Shuyu
    Li, Xiang
    Liu, Yinbo
    Zhu, Xiaolei
    JOURNAL OF PROTEOME RESEARCH, 2023, 22 (03) : 718 - 728
  • [44] Sentiment classification of microblog: A framework based on BERT and CNN with attention mechanism
    Jia, Keliang
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [45] Research on Chinese Keyword Recognition Based on BERT Binary Classification Algorithm
    Zhu, Chunling
    Wu, Di
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 689 - 695
  • [46] Understanding stance classification of BERT models: an attention-based framework
    Saenz, Carlos Abel Cordova
    Becker, Karin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 66 (1) : 419 - 451
  • [47] SSMBERT: A Space Science Mission Requirement Classification Method Based on BERT
    Zhu, Yiming
    Zhang, Yuzhu
    Peng, Xiaodong
    Xue, Changbin
    Chen, Bin
    Cao, Yu
    AEROSPACE, 2024, 11 (12)
  • [48] Understanding stance classification of BERT models: an attention-based framework
    Carlos Abel Córdova Sáenz
    Karin Becker
    Knowledge and Information Systems, 2024, 66 : 419 - 451
  • [49] Multi-label Classification of Chinese Judicial Documents based on BERT
    Dai, Mian
    Liu, Chao-Lin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1866 - 1867
  • [50] A Sentence Classification Method for Chinese Spelling Error Detection Based on BERT
    Jiang, Jin
    Zhou, Yanquan
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 369 - 372