Unified benchmark for zero-shot Turkish text classification

被引:7
作者
celik, Emrecan [1 ]
Dalyan, Tugba [1 ]
机构
[1] Istanbul Bilgi Univ, Dept Comp Engn, Eski Silahtaraga Elekt Santrali Kazim Karabekir Ca, TR-34060 Istanbul, Turkiye
关键词
Text classification; Zero-shot learning; Next sentence prediction; Natural language inference; Masked language modeling; DATASET;
D O I
10.1016/j.ipm.2023.103298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effective learning schemes such as fine-tuning, zero-shot, and few-shot learning, have been widely used to obtain considerable performance with only a handful of annotated training data. In this paper, we presented a unified benchmark to facilitate the problem of zeroshot text classification in Turkish. For this purpose, we evaluated three methods, namely, Natural Language Inference, Next Sentence Prediction and our proposed model that is based on Masked Language Modeling and pre-trained word embeddings on nine Turkish datasets for three main categories: topic, sentiment, and emotion. We used pre-trained Turkish monolingual and multilingual transformer models which can be listed as BERT, ConvBERT, DistilBERT and mBERT. The results showed that ConvBERT with the NLI method yields the best results with 79% and outperforms previously used multilingual XLM-RoBERTa model by 19.6%. The study contributes to the literature using different and unattempted transformer models for Turkish and showing improvement of zero-shot text classification performance for monolingual models over multilingual models.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Zero-shot Relation Classification from Side Information
    Gong, Jiaying
    Eldardiry, Hoda
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 576 - 585
  • [22] Learning to Model Relationships for Zero-Shot Video Classification
    Gao, Junyu
    Zhang, Tianzhu
    Xu, Changsheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3476 - 3491
  • [23] Zero-Shot Audio Classification using Image Embeddings
    Dogan, Duygu
    Xie, Huang
    Heittola, Toni
    Virtanen, Tuomas
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1 - 5
  • [24] Enhanced VAEGAN: a zero-shot image classification method
    Bo Ding
    Yufei Fan
    Yongjun He
    Jing Zhao
    Applied Intelligence, 2023, 53 : 9235 - 9246
  • [25] Dual Projective Zero-Shot Learning Using Text Descriptions
    Rao, Yunbo
    Yang, Ziqiang
    Zeng, Shaoning
    Wang, Qifeng
    Pu, Jiansu
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [26] Zero-Shot Classification of Art With Large Language Models
    Tojima, Tatsuya
    Yoshida, Mitsuo
    IEEE ACCESS, 2025, 13 : 17426 - 17439
  • [27] Zero-shot learning for requirements classification: An exploratory study
    Alhoshan, Waad
    Ferrari, Alessio
    Zhao, Liping
    INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 159
  • [28] Review of Zero-Shot Remote Sensing Image Scene Classification
    Tan, Xiaomeng
    Xi, Bobo
    Li, Jiaojiao
    Zheng, Tie
    Li, Yunsong
    Xue, Changbin
    Chanussot, Jocelyn
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11274 - 11289
  • [29] Zero-Shot Image Classification Method Based on Attribute Weighting
    Chen, Wenbai
    Chen, Xiangfeng
    Liu, Chang
    Wu, Hao
    Li, Denghua
    PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 84 - 88
  • [30] Zero-Shot Image Classification Based on Deep Feature Extraction
    Wang, Xuesong
    Chen, Chen
    Cheng, Yuhu
    Wang, Z. Jane
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2018, 10 (02) : 432 - 444