Punjabi news multi-classification using language generation-based optimized long short-term memory networks

被引:0
作者
Gupta, Varun [1 ]
Gupta, Ekta [1 ]
机构
[1] Chandigarh Coll Engn & Technol, Dept Comp Sci & Engn, Chandigarh, India
关键词
Text-classification; News classification; Asian languages; Punjabi language; Deep neural networks; Recurrent neural networks; LSTM; Averaged SGD-Long short term memory networks;
D O I
10.1007/s12530-022-09428-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is a method that assigns a specific category to each piece of written information. It is one of the fundamental tasks in natural language processing that has a wide range of applications like spam detection, sentiment analysis, etc. One type of text classification is news classification which can help the reader to focus on news as per their choice. In this paper, we propose a novel method for multiclassification of Punjabi news articles using a pretrained language generation model based optimized and regularized long short-term memory model. The proposed method employs Averaged Stochastic Gradient Descent Weight-Dropped LSTM model, which uses a recurrent regularization technique known as DropConnect on hidden-to-hidden weights and a variant of the averaged stochastic gradient method wherein the averaging trigger is determined using a non-monotonic condition instead of being tuned by the user. The proposed news classification method works in three stages. In the first stage, we train a language model on Punjabi text acquired from Wikipedia, and in the second stage, we fine-tune the language model on the Punjabi news dataset. Finally, we train a classifier using the pretrained encoder part of the language model. The pretrained encoder part of the language model helps the classifier in the linguistic understanding of the text, resulting in better classification results on the text. The results obtained from the proposed work indicate that the proposed method outperforms the other direct methods of news classification, which are not using pretrained language generation models.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [41] Long Short-Term Memory Networks Based Fall Detection Using Unified Pose Estimation
    Adhikari, Kripesh
    Bouchachia, Hamid
    Nait-Charif, Hammadi
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [42] Incorporating Pre-Training in Long Short-Term Memory Networks for Tweets Classification
    Yuan, Shuhan
    Wu, Xintao
    Xiang, Yang
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1329 - 1334
  • [43] A short-term prediction model of global ionospheric VTEC based on the combination of long short-term memory and convolutional long short-term memory
    Chen, Peng
    Wang, Rong
    Yao, Yibin
    Chen, Hao
    Wang, Zhihao
    An, Zhiyuan
    JOURNAL OF GEODESY, 2023, 97 (05)
  • [44] Short-Term Load Forecasting using A Long Short-Term Memory Network
    Liu, Chang
    Jin, Zhijian
    Gu, Jie
    Qiu, Caiming
    2017 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE EUROPE (ISGT-EUROPE), 2017,
  • [45] Arabic Language Opinion Mining Based on Long Short-Term Memory (LSTM)
    Setyanto, Arief
    Laksito, Arif
    Alarfaj, Fawaz
    Alreshoodi, Mohammed
    Kusrini
    Oyong, Irwan
    Hayaty, Mardhiya
    Alomair, Abdullah
    Almusallam, Naif
    Kurniasari, Lilis
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [46] Aircraft Trajectory Prediction Using Deep Long Short-Term Memory Networks
    Zhao, Ziyu
    Zeng, Weili
    Quan, Zhibin
    Chen, Mengfei
    Yang, Zhao
    CICTP 2019: TRANSPORTATION IN CHINA-CONNECTING THE WORLD, 2019, : 124 - 135
  • [47] Dialog State Tracking Using Long Short-term Memory Neural Networks
    Yang, Xiaohao
    Liu, Jia
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1800 - 1804
  • [48] Tailings Pond Risk Prediction Using Long Short-Term Memory Networks
    Li, Jianwei
    Chen, Haoyu
    Zhou, Ting
    Li, Xiaowen
    IEEE ACCESS, 2019, 7 : 182527 - 182537
  • [49] Photovoltaic Farm Production Forecasting: Modified Metaheuristic Optimized Long Short-Term Memory-Based Networks Approach
    Stojkovic, Aleksandar
    Nikolic, Bosko
    Zivkovic, Miodrag
    Bacanin, Nebojsa
    IEEE ACCESS, 2025, 13 : 25198 - 25222
  • [50] Classification of Radicalism Content from Twitter Written in Indonesian Language using Long Short Term Memory
    Idris, Nur Oktavin
    Widyawan
    Adji, Teguh Bharata
    2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,