Unified benchmark for zero-shot Turkish text classification

被引:8
作者
celik, Emrecan [1 ]
Dalyan, Tugba [1 ]
机构
[1] Istanbul Bilgi Univ, Dept Comp Engn, Eski Silahtaraga Elekt Santrali Kazim Karabekir Ca, TR-34060 Istanbul, Turkiye
关键词
Text classification; Zero-shot learning; Next sentence prediction; Natural language inference; Masked language modeling; DATASET;
D O I
10.1016/j.ipm.2023.103298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effective learning schemes such as fine-tuning, zero-shot, and few-shot learning, have been widely used to obtain considerable performance with only a handful of annotated training data. In this paper, we presented a unified benchmark to facilitate the problem of zeroshot text classification in Turkish. For this purpose, we evaluated three methods, namely, Natural Language Inference, Next Sentence Prediction and our proposed model that is based on Masked Language Modeling and pre-trained word embeddings on nine Turkish datasets for three main categories: topic, sentiment, and emotion. We used pre-trained Turkish monolingual and multilingual transformer models which can be listed as BERT, ConvBERT, DistilBERT and mBERT. The results showed that ConvBERT with the NLI method yields the best results with 79% and outperforms previously used multilingual XLM-RoBERTa model by 19.6%. The study contributes to the literature using different and unattempted transformer models for Turkish and showing improvement of zero-shot text classification performance for monolingual models over multilingual models.
引用
收藏
页数:14
相关论文
共 50 条
[41]   Cross-Domain Adversarial Learning for Zero-Shot Classification [J].
Liu H. ;
Zheng Q. ;
Luo M. ;
Zhao H. ;
Xiao Y. ;
Lü Y. .
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (12) :2521-2535
[42]   Zero-shot classification by transferring knowledge and preserving data structure [J].
Li, Xiao ;
Fang, Min ;
Wu, Jinqiao .
NEUROCOMPUTING, 2017, 238 :76-83
[43]   Zero-Shot Image Classification Based on a Learnable Deep Metric [J].
Liu, Jingyi ;
Shi, Caijuan ;
Tu, Dongjing ;
Shi, Ze ;
Liu, Yazhi .
SENSORS, 2021, 21 (09)
[44]   Adaptive Relation-Aware Network for zero-shot classification [J].
Zhang, Xun ;
Liu, Yang ;
Dang, Yuhao ;
Gao, Xinbo ;
Han, Jungong ;
Shao, Ling .
NEURAL NETWORKS, 2024, 174
[45]   Zero-Shot Video Moment Retrieval With Angular Reconstructive Text Embeddings [J].
Jiang, Xun ;
Xu, Xing ;
Zhou, Zailei ;
Yang, Yang ;
Shen, Fumin ;
Shen, Heng Tao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :9657-9670
[46]   Zero-Shot Text Matching for Automated Auditing using Sentence Transformers [J].
Biesner, David ;
Pielka, Maren ;
Ramamurthy, Rajkumar ;
Dilmaghani, Tim ;
Kliem, Bernd ;
Loitz, Rudiger ;
Sifa, Rafet .
2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, :1637-1642
[47]   HierCode: A lightweight hierarchical codebook for zero-shot Chinese text recognition [J].
Zhang, Yuyi ;
Zhu, Yuanzhi ;
Peng, Dezhi ;
Zhang, Peirong ;
Yang, Zhenhua ;
Yang, Zhibo ;
Jin, Lianwen .
PATTERN RECOGNITION, 2025, 158
[48]   Zero-Shot Chinese Text Recognition via Matching Class Embedding [J].
Huang, Yuhao ;
Jin, Lianwen ;
Peng, Dezhi .
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 :127-141
[49]   A Review of Few-Shot and Zero-Shot Learning for Node Classification in Social Networks [J].
Chen, Junyang ;
Mi, Rui ;
Wang, Huan ;
Wu, Huisi ;
Mo, Jiqian ;
Guo, Jingcai ;
Lai, Zhihui ;
Zhang, Liangjie ;
Leung, Victor C. M. .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
[50]   Zero-Shot Hyperspectral Sharpening [J].
Dian, Renwei ;
Guo, Anjing ;
Li, Shutao .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) :12650-12666