Emerging industry classification based on BERT model

被引:0
|
作者
Yang, Baocheng [1 ]
Zhang, Bing [1 ]
Cutsforth, Kevin [2 ]
Yu, Shanfu [1 ]
Yu, Xiaowen [3 ]
机构
[1] Huanghe Sci & Technol Univ, Zhengzhou, Peoples R China
[2] Royal Agr Univ, Cirencester, England
[3] Henan Finance Univ, Zhengzhou, Peoples R China
关键词
Industry classification; Machine learning; BERT;
D O I
10.1016/j.is.2024.102484
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate industry classification is central to economic analysis and policy making. Current classification systems, while foundational, exhibit limitations in the face of the exponential growth of big data. These limitations include subjectivity, leading to inconsistencies and misclassifications. To overcome these shortcomings, this paper focuses on utilizing the BERT model for classifying emerging industries through the identification of salient attributes within business descriptions. The proposed method identifies clusters of firms within distinct industries, thereby transcending the restrictions inherent in existing classification systems. The model exhibits an impressive degree of precision in categorizing business descriptions, achieving accuracy rates spanning from 84.11% to 99.66% across all 16 industry classifications. This research enriches the field of industry classification literature through a practical examination of the efficacy of machine learning techniques. Our experiments achieved strong performance, highlighting the effectiveness of the BERT model in accurately classifying and identifying emerging industries, providing valuable insights for industry analysts and policymakers.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] BERT-5mC: an interpretable model for predicting 5-methylcytosine sites of DNA based on BERT
    Wang, Shuyu
    Liu, Yinbo
    Liu, Yufeng
    Zhang, Yong
    Zhu, Xiaolei
    PEERJ, 2023, 11
  • [22] Chinese Text Classification Method Based on BERT Word Embedding
    Wang, Ziniu
    Huang, Zhilin
    Gao, Jianling
    2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 66 - 71
  • [23] Disaster related tweet classification method based on BERT and GAT
    Nayan Ranjan Paul
    Rakesh Chandra Balabantaray
    International Journal of Information Technology, 2025, 17 (4) : 1987 - 1999
  • [24] The Automatic Text Classification Method Based on BERT and Feature Union
    Li, Wenting
    Gao, Shangbing
    Zhou, Hong
    Huang, Zihe
    Zhang, Kewen
    Li, Wei
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 774 - 777
  • [25] Proposing sentiment analysis model based on BERT and XLNet for movie reviews
    Danyal, Mian Muhammad
    Khan, Sarwar Shah
    Khan, Muzammil
    Ullah, Subhan
    Mehmood, Faheem
    Ali, Ijaz
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 64315 - 64339
  • [26] Methods to Enhance BERT in Aspect-Based Sentiment Classification
    Zhao, Yufeng
    Soerjodjojo, Evelyn
    Che, Haiying
    2022 EURO-ASIA CONFERENCE ON FRONTIERS OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, FCSIT, 2022, : 21 - 27
  • [27] Research on sentiment classification of futures predictive texts based on BERT
    Weng Xiaofeng
    Zhao Jinghua
    Jiang Chenxi
    Ji Yiying
    COMPUTING, 2024, 106 (12) : 4231 - 4248
  • [28] Emotion Classification of Text Based on BERT and Broad Learning System
    Peng, Sancheng
    Zeng, Rong
    Liu, Hongzhan
    Chen, Guanghao
    Wu, Ruihuan
    Yang, Aimin
    Yu, Shui
    WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 382 - 396
  • [29] A BERT-based Idiom Detection Model
    Gamage, Gihan
    De Silva, Daswin
    Adikari, Achini
    Alahakoon, Damminda
    2022 15TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI), 2022,
  • [30] Chinese Triple Extraction Based on BERT Model
    Deng, Weidong
    Liu, Yun
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,