Bangla News Classification using Graph Convolutional Networks

被引:0
作者
Rahman, Md Mahbubur [1 ]
Khan, Md Akib Zabed [2 ]
Biswas, Al Amin [3 ]
机构
[1] Crowd Realty, Tokyo, Japan
[2] Bangladesh Univ Business & Technol, Dept CSE, Dhaka, Bangladesh
[3] Daffodil Int Univ, Dept CSE, Dhaka, Bangladesh
来源
2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI) | 2021年
关键词
Bangla News; Graph Convolutional Networks; Document Classification; NLP;
D O I
10.1109/ICCC150826.2021.9402567
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online Bangla news has rapidly increased in the era of the information age. Each news site has its different categorization for grouping their news. The Layout and categorization of the online Bangla news articles cannot perpetually meet the individual's needs due to the heterogeneity. So, overcoming this issue and classifying the online Bangla news articles according to the preference of the user is an arduous task. So, it is essential to provide state-of-the-art solutions as well as the best way to solve this problem. The paper aims to build an automated system to classify the Bangla news contents and also find out the state-of-the-art solutions for the small size dataset. It is known that most of the machine learning models need huge amounts of data for the proper training and testing of the models. But due to the scarcity of the dataset, it is not always possible to provide the state-of-the-art solutions. But, in this research work, we have found that Text-GCN performed better than the BiLSTM, GRU-LSTM, LSTM, Char-CNN, and BERT to classify the online Bangla news in spite of the small size of the dataset. The obtained experimental result shows the efficiency of the Text-GCN over the other models in terms of accuracy, precision recall, and F1-score.
引用
收藏
页数:5
相关论文
共 13 条
  • [1] Chy Abu Nowshed, 2014, 2013 16th International Conference on Computer and Information Technology (ICCIT), P366, DOI 10.1109/ICCITechn.2014.6997369
  • [2] Conneau A., 2016, ARXIV PREPRINT ARXIV
  • [3] Classification of news-related tweets
    Demirsoz, Orhan
    Ozcan, Rifat
    [J]. JOURNAL OF INFORMATION SCIENCE, 2017, 43 (04) : 509 - 524
  • [4] Dhar A., 2018, 2018 3 INT C INT THI, P1, DOI [10.1109/IoT-SIU.2018.8519866, DOI 10.1109/IOT-SIU.2018.8519866]
  • [5] Ghosal D., 2019, ARXIV PREPRINT ARXIV
  • [6] Hossain M. R., INT J COMPUTER APPL, V975, P8887
  • [7] Huang C. H., 2020, POLYM REV, P1, DOI DOI 10.1080/15583724.2019.1688830
  • [8] Islam M., 2017, ARXIV PREPRINT ARXIV
  • [9] Mandal A. K., 2014, INT J ARTIFICIAL INT, V5
  • [10] Rahman M.M., 2020 4 INT S MULT ST, P1, DOI [10.1109/ISMSIT50672.2020.9254416, DOI 10.1109/ISMSIT50672.2020.9254416]