Punjabi news multi-classification using language generation-based optimized long short-term memory networks

被引:0
作者
Gupta, Varun [1 ]
Gupta, Ekta [1 ]
机构
[1] Chandigarh Coll Engn & Technol, Dept Comp Sci & Engn, Chandigarh, India
关键词
Text-classification; News classification; Asian languages; Punjabi language; Deep neural networks; Recurrent neural networks; LSTM; Averaged SGD-Long short term memory networks;
D O I
10.1007/s12530-022-09428-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is a method that assigns a specific category to each piece of written information. It is one of the fundamental tasks in natural language processing that has a wide range of applications like spam detection, sentiment analysis, etc. One type of text classification is news classification which can help the reader to focus on news as per their choice. In this paper, we propose a novel method for multiclassification of Punjabi news articles using a pretrained language generation model based optimized and regularized long short-term memory model. The proposed method employs Averaged Stochastic Gradient Descent Weight-Dropped LSTM model, which uses a recurrent regularization technique known as DropConnect on hidden-to-hidden weights and a variant of the averaged stochastic gradient method wherein the averaging trigger is determined using a non-monotonic condition instead of being tuned by the user. The proposed news classification method works in three stages. In the first stage, we train a language model on Punjabi text acquired from Wikipedia, and in the second stage, we fine-tune the language model on the Punjabi news dataset. Finally, we train a classifier using the pretrained encoder part of the language model. The pretrained encoder part of the language model helps the classifier in the linguistic understanding of the text, resulting in better classification results on the text. The results obtained from the proposed work indicate that the proposed method outperforms the other direct methods of news classification, which are not using pretrained language generation models.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [31] Flash Flood Forecasting Based on Long Short-Term Memory Networks
    Song, Tianyu
    Ding, Wei
    Wu, Jian
    Liu, Haixing
    Zhou, Huicheng
    Chu, Jinggang
    WATER, 2020, 12 (01)
  • [32] Music generation with long short-term memory network
    Yang, Junye
    SECOND IYSF ACADEMIC SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, 2021, 12079
  • [33] Speech Emotion Recognition for Indonesian Language Using Long Short-Term Memory
    Lasiman, Jeremia Jason
    Lestari, Dessi Puji
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2018, : 40 - 43
  • [34] Incorporating pre-training in long short-term memory networks for tweet classification
    Yuan, Shuhan
    Wu, Xintao
    Xiang, Yang
    SOCIAL NETWORK ANALYSIS AND MINING, 2018, 8 (01)
  • [35] Diagnosing Dysarthria with Long Short-Term Memory Networks
    Mayle, Alex
    Mou, Zhiwei
    Bunescu, Razvan
    Mirshekarian, Sadegh
    Xu, Li
    Liu, Chang
    INTERSPEECH 2019, 2019, : 4514 - 4518
  • [36] Malicious Traffic classification Using Long Short-Term Memory (LSTM) Model
    K. Naresh Kumar Thapa
    N. Duraipandian
    Wireless Personal Communications, 2021, 119 : 2707 - 2724
  • [37] Arrhythmia Classification Using Long Short-Term Memory with Adaptive Learning Rate
    Assodiky, Hilmy
    Syarif, Iwan
    Badriyah, Tessy
    EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2018, 6 (01) : 75 - 91
  • [38] Transportation Mode Detection Using an Optimized Long Short-Term Memory Model on Multimodal Sensor Data
    Drosouli, Ifigenia
    Voulodimos, Athanasios
    Miaoulis, Georgios
    Mastorocostas, Paris
    Ghazanfarpour, Djamchid
    ENTROPY, 2021, 23 (11)
  • [39] Terrain Classification with Crawling Robot Using Long Short-Term Memory Network
    Szadkowski, Rudolf J.
    Drchal, Jan
    Faigl, Jan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 771 - 780
  • [40] Malicious Traffic classification Using Long Short-Term Memory (LSTM) Model
    Thapa, K. Naresh Kumar
    Duraipandian, N.
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 119 (03) : 2707 - 2724