Punjabi news multi-classification using language generation-based optimized long short-term memory networks

被引:0
|
作者
Gupta, Varun [1 ]
Gupta, Ekta [1 ]
机构
[1] Chandigarh Coll Engn & Technol, Dept Comp Sci & Engn, Chandigarh, India
关键词
Text-classification; News classification; Asian languages; Punjabi language; Deep neural networks; Recurrent neural networks; LSTM; Averaged SGD-Long short term memory networks;
D O I
10.1007/s12530-022-09428-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is a method that assigns a specific category to each piece of written information. It is one of the fundamental tasks in natural language processing that has a wide range of applications like spam detection, sentiment analysis, etc. One type of text classification is news classification which can help the reader to focus on news as per their choice. In this paper, we propose a novel method for multiclassification of Punjabi news articles using a pretrained language generation model based optimized and regularized long short-term memory model. The proposed method employs Averaged Stochastic Gradient Descent Weight-Dropped LSTM model, which uses a recurrent regularization technique known as DropConnect on hidden-to-hidden weights and a variant of the averaged stochastic gradient method wherein the averaging trigger is determined using a non-monotonic condition instead of being tuned by the user. The proposed news classification method works in three stages. In the first stage, we train a language model on Punjabi text acquired from Wikipedia, and in the second stage, we fine-tune the language model on the Punjabi news dataset. Finally, we train a classifier using the pretrained encoder part of the language model. The pretrained encoder part of the language model helps the classifier in the linguistic understanding of the text, resulting in better classification results on the text. The results obtained from the proposed work indicate that the proposed method outperforms the other direct methods of news classification, which are not using pretrained language generation models.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [1] Punjabi news multi-classification using language generation-based optimized long short-term memory networks
    Varun Gupta
    Ekta Gupta
    Evolving Systems, 2023, 14 : 49 - 58
  • [2] Classification of Antibacterial Peptides Using Long Short-Term Memory Recurrent Neural Networks
    Youmans, Michael
    Spainhour, John C. G.
    Qiu, Peng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (04) : 1134 - 1140
  • [3] MULTI-TEMPORAL LAND COVER CLASSIFICATION WITH LONG SHORT-TERM MEMORY NEURAL NETWORKS
    Russwurm, M.
    Koermer, M.
    ISPRS HANNOVER WORKSHOP: HRIGI 17 - CMRT 17 - ISA 17 - EUROCOW 17, 2017, 42-1 (W1): : 551 - 558
  • [4] SPOKEN LANGUAGE UNDERSTANDING USING LONG SHORT-TERM MEMORY NEURAL NETWORKS
    Yao, Kaisheng
    Peng, Baolin
    Zhang, Yu
    Yu, Dong
    Zweig, Geoffrey
    Shi, Yangyang
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 189 - 194
  • [5] Deterministic convergence analysis for regularized long short-term memory and its application to regression and multi-classification problems
    Kang, Qian
    Yu, Dengxiu
    Cheong, Kang Hao
    Wang, Zhen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [6] Surveillance videos classification based on multilayer long short-term memory networks
    Hong Zhang
    Liang Zhao
    Gang Dai
    Multimedia Tools and Applications, 2020, 79 : 12125 - 12137
  • [7] Text Classification Using Long Short-Term Memory
    Sari, Winda Kurnia
    Rini, Dian Palupi
    Malik, Reza Firsandaya
    2019 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICECOS 2019), 2019, : 150 - 155
  • [8] Surveillance videos classification based on multilayer long short-term memory networks
    Zhang, Hong
    Zhao, Liang
    Dai, Gang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) : 12125 - 12137
  • [9] Language Modeling Using Part-of-speech and Long Short-Term Memory Networks
    Norouzi, Sanaz Saki
    Akbari, Ahmad
    Nasersharif, Babak
    2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 182 - 187
  • [10] NEWLSTM: An Optimized Long Short-Term Memory Language Model for Sequence Prediction
    Wang, Qing
    Peng, Rong-Qun
    Wang, Jia-Qiang
    Li, Zhi
    Qu, Han-Bing
    IEEE ACCESS, 2020, 8 : 65395 - 65401