A Category Hybrid Embedding Based Approach for Power Text Hierarchical Classification

被引:0
|
作者
Chen X. [1 ]
Gao P. [1 ]
Liang Y. [1 ]
Ma Y. [1 ]
机构
[1] School of Control and Computer Engineering, North China Electric Power University, Beijing
来源
Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis | 2022年 / 58卷 / 01期
关键词
Category embedding; Hierarchical text classification; Power information technology; Power text classification;
D O I
10.13209/j.0479-8023.2021.104
中图分类号
学科分类号
摘要
Aiming at the problem that the current power text classification methods ignore the latent semantic association between category labels and therefore lead to low classification performance, a hierarchical multi-label power text classification method is proposed. Firstly, a power multi-label text dataset is built using automatic information extraction based on power unstructured texts, and the hierarchical structural relationships between categories are constructed by leveraging relevant domain knowledge. Secondly, a text classification method HONLSTM-BERT is proposed based on hybrid embeddings of category structure and label semantics for hierarchically classifying power texts in a top-down manner. At last, experiments were made in comparison with some popular text classification methods, and the experimental results show that proposed HONLSTM-BERT method achieves superior classification accuracy, and can efficiently improve the performance of automatic text classification. © 2022 Peking University.
引用
收藏
页码:77 / 82
页数:5
相关论文
共 14 条
  • [1] (2018)
  • [2] (2019)
  • [3] Cai Xinlei, Cui Yanlin, Dong Kai, Et al., Framework of smart regulation system for large-scale power grid based on big data and artificial intelligence, Procee-dings of International Conference on Machine Learn-ing and Big Data Analytics for IoT Security and Privacy, pp. 440-445, (2020)
  • [4] Melamud O, Goldberger J, Dagan I., Context2vec: learning generic context embedding with bidirectional LSTM, Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, pp. 51-61, (2016)
  • [5] Kowsari K, Brown D E, Heidarysafa M, Et al., HDL-Tex: hierarchical deep learning for text classifica- tion, Proceedings of International Conference on Machine Learning and Applications, pp. 364-371, (2017)
  • [6] Xi Ziyue, Chen Xiaona, Ahmad T, Et al., A novel ensemble approach to multi-label classification for electric power fault diagnosis, Proceedings of IEEE 7th International Conference on Computer Science and Network Technology, pp. 278-282, (2019)
  • [7] Peng Hao, Li Jianxin, He Yu, Et al., Large-scale hierarchical text classification with recursively regul-arized deep graph-CNN, Proceedings of the World Wide Web Conference, pp. 1063-1072, (2018)
  • [8] Wehrmann J, Cerri R, Barros R., Hierarchical multi-label classification networks, Proceedings of the 35th International Conference on Machine Learning, 12, pp. 8321-8330, (2018)
  • [9] Shen Yikang, Tan S, Sordoni A, Et al., Ordered neurons: integrating tree structures into recurrent neural networks, Proceedings of the 7th Internation-al Conference on Learning Representations (ICLR' 19), pp. 1-14, (2019)
  • [10] Chawla N V, Bowyer K W, Hall L O, Et al., SMOTE: synthetic minority over-sampling technique, Journal of Artificial Intelligence Research, 16, 1, pp. 321-357, (2002)