Punjabi news multi-classification using language generation-based optimized long short-term memory networks

被引：0

作者：

Gupta, Varun ^{[1
]}

Gupta, Ekta ^{[1
]}

机构：

[1] Chandigarh Coll Engn & Technol, Dept Comp Sci & Engn, Chandigarh, India

来源：

EVOLVING SYSTEMS | 2023年 / 14卷 / 01期

关键词：

Text-classification; News classification; Asian languages; Punjabi language; Deep neural networks; Recurrent neural networks; LSTM; Averaged SGD-Long short term memory networks;

D O I：

10.1007/s12530-022-09428-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text classification is a method that assigns a specific category to each piece of written information. It is one of the fundamental tasks in natural language processing that has a wide range of applications like spam detection, sentiment analysis, etc. One type of text classification is news classification which can help the reader to focus on news as per their choice. In this paper, we propose a novel method for multiclassification of Punjabi news articles using a pretrained language generation model based optimized and regularized long short-term memory model. The proposed method employs Averaged Stochastic Gradient Descent Weight-Dropped LSTM model, which uses a recurrent regularization technique known as DropConnect on hidden-to-hidden weights and a variant of the averaged stochastic gradient method wherein the averaging trigger is determined using a non-monotonic condition instead of being tuned by the user. The proposed news classification method works in three stages. In the first stage, we train a language model on Punjabi text acquired from Wikipedia, and in the second stage, we fine-tune the language model on the Punjabi news dataset. Finally, we train a classifier using the pretrained encoder part of the language model. The pretrained encoder part of the language model helps the classifier in the linguistic understanding of the text, resulting in better classification results on the text. The results obtained from the proposed work indicate that the proposed method outperforms the other direct methods of news classification, which are not using pretrained language generation models.

引用

页码：49 / 58

页数：10

共 50 条

[31] Flash Flood Forecasting Based on Long Short-Term Memory Networks
Song, Tianyu
Ding, Wei
Wu, Jian
Liu, Haixing
Zhou, Huicheng
Chu, Jinggang
WATER, 2020, 12 (01)
[32] Music generation with long short-term memory network
Yang, Junye
SECOND IYSF ACADEMIC SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, 2021, 12079
[33] Speech Emotion Recognition for Indonesian Language Using Long Short-Term Memory
Lasiman, Jeremia Jason
Lestari, Dessi Puji
2018 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2018, : 40 - 43
[34] Incorporating pre-training in long short-term memory networks for tweet classification
Yuan, Shuhan
Wu, Xintao
Xiang, Yang
SOCIAL NETWORK ANALYSIS AND MINING, 2018, 8 (01)
[35] Diagnosing Dysarthria with Long Short-Term Memory Networks
Mayle, Alex
Mou, Zhiwei
Bunescu, Razvan
Mirshekarian, Sadegh
Xu, Li
Liu, Chang
INTERSPEECH 2019, 2019, : 4514 - 4518
[36] Malicious Traffic classification Using Long Short-Term Memory (LSTM) Model
K. Naresh Kumar Thapa
N. Duraipandian
Wireless Personal Communications, 2021, 119 : 2707 - 2724
[37] Arrhythmia Classification Using Long Short-Term Memory with Adaptive Learning Rate
Assodiky, Hilmy
Syarif, Iwan
Badriyah, Tessy
EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2018, 6 (01) : 75 - 91
[38] Transportation Mode Detection Using an Optimized Long Short-Term Memory Model on Multimodal Sensor Data
Drosouli, Ifigenia
Voulodimos, Athanasios
Miaoulis, Georgios
Mastorocostas, Paris
Ghazanfarpour, Djamchid
ENTROPY, 2021, 23 (11)
[39] Terrain Classification with Crawling Robot Using Long Short-Term Memory Network
Szadkowski, Rudolf J.
Drchal, Jan
Faigl, Jan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 771 - 780
[40] Malicious Traffic classification Using Long Short-Term Memory (LSTM) Model
Thapa, K. Naresh Kumar
Duraipandian, N.
WIRELESS PERSONAL COMMUNICATIONS, 2021, 119 (03) : 2707 - 2724

← 1 2 3 4 5 →