Punjabi news multi-classification using language generation-based optimized long short-term memory networks

被引:0
作者
Gupta, Varun [1 ]
Gupta, Ekta [1 ]
机构
[1] Chandigarh Coll Engn & Technol, Dept Comp Sci & Engn, Chandigarh, India
关键词
Text-classification; News classification; Asian languages; Punjabi language; Deep neural networks; Recurrent neural networks; LSTM; Averaged SGD-Long short term memory networks;
D O I
10.1007/s12530-022-09428-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is a method that assigns a specific category to each piece of written information. It is one of the fundamental tasks in natural language processing that has a wide range of applications like spam detection, sentiment analysis, etc. One type of text classification is news classification which can help the reader to focus on news as per their choice. In this paper, we propose a novel method for multiclassification of Punjabi news articles using a pretrained language generation model based optimized and regularized long short-term memory model. The proposed method employs Averaged Stochastic Gradient Descent Weight-Dropped LSTM model, which uses a recurrent regularization technique known as DropConnect on hidden-to-hidden weights and a variant of the averaged stochastic gradient method wherein the averaging trigger is determined using a non-monotonic condition instead of being tuned by the user. The proposed news classification method works in three stages. In the first stage, we train a language model on Punjabi text acquired from Wikipedia, and in the second stage, we fine-tune the language model on the Punjabi news dataset. Finally, we train a classifier using the pretrained encoder part of the language model. The pretrained encoder part of the language model helps the classifier in the linguistic understanding of the text, resulting in better classification results on the text. The results obtained from the proposed work indicate that the proposed method outperforms the other direct methods of news classification, which are not using pretrained language generation models.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [21] Human activity classification using long short-term memory network
    Welhenge, Anuradhi Malshika
    Taparugssanagorn, Attaphongse
    SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 13 (04) : 651 - 656
  • [22] Research on Attention Classification Based on Long Short-term Memory Network
    Wang Pai
    Wu Fan
    Wang Mei
    Qin Xue-Bin
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1148 - 1151
  • [23] A text classification method based on a convolutional and bidirectional long short-term memory model
    Huan, Hai
    Guo, Zelin
    Cai, Tingting
    He, Zichen
    CONNECTION SCIENCE, 2022, 34 (01) : 2108 - 2124
  • [24] VOICE CONVERSION USING DEEP BIDIRECTIONAL LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS
    Sun, Lifa
    Kang, Shiyin
    Li, Kun
    Meng, Helen
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4869 - 4873
  • [25] Chord-based music generation using long short-term memory neural networks in the context of artificial intelligence
    Li, Fanfan
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (05) : 6068 - 6092
  • [26] Using long short-term memory networks for river flow prediction
    Xu, Wei
    Jiang, Yanan
    Zhang, Xiaoli
    Li, Yi
    Zhang, Run
    Fu, Guangtao
    HYDROLOGY RESEARCH, 2020, 51 (06): : 1358 - 1376
  • [27] Subclinical tremor differentiation using long short-term memory networks
    Nanayakkara, Gerard Ruchin Randil
    Chan, Ping Yi
    PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2025,
  • [28] DC Pulsed Load Transient Classification Using Long Short-Term Memory Recurrent Neural Networks
    Oslebo, Damian
    Corzine, Keith
    Weatherford, Todd
    Maqsood, Atif
    Norton, Matthew
    2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
  • [29] Long Short-Term Memory for Bed Position Classification
    Sao, Sakada
    Sornlertlamvanich, Virach
    PROCEEDINGS OF THE 2019 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT): ENCOMPASSING INTELLIGENT TECHNOLOGY AND INNOVATION TOWARDS THE NEW ERA OF HUMAN LIFE, 2019, : 28 - 31
  • [30] Long Short-term Memory based on a Reward/punishment Strategy for Recurrent Neural Networks
    Liu, Jiangjiang
    Luo, Biao
    Yan, Pengfei
    Wang, Ding
    Liu, Derong
    2017 32ND YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2017, : 327 - 332