Multilingual Topic Classification in X: Dataset and Analysis

被引:0
|
作者
Antypas, Dimosthenis [1 ]
Ushio, Asahi [2 ]
Barbieri, Francesco [3 ]
Camacho-Collados, Jose [1 ]
机构
[1] Cardiff NLP, Cardiff University, United Kingdom
[2] Amazon, Tokyo, Japan
[3] Snap Inc., Santa Monica,CA, United States
来源
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference | 2024年
关键词
'current - Computational scientists - Linguistic analysis - Media content - Multilingual analysis - Online dialog - Social media - Topic Classification - Topic Modeling - Traditional techniques;
D O I
暂无
中图分类号
学科分类号
摘要
50
引用
收藏
页码:20136 / 20152
相关论文
共 50 条
  • [1] HashCat: A Novel Approach for the Topic Classification of Multilingual Twitter Trends
    Kausar, Soufia
    Tahir, Bilal
    Mehmood, Muhammad Amir
    2021 INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT 2021), 2021, : 212 - 217
  • [2] Data generation approaches for topic classification in multilingual spoken dialog systems
    Montenegro, C.
    Santana, R.
    Lozano, J. A.
    12TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2019), 2019, : 211 - 217
  • [3] Zika discourse in the Americas: A multilingual topic analysis of Twitter
    Pruss, Dasha
    Fujinuma, Yoshinari
    Daughton, Ashlynn R.
    Paul, Michael J.
    Arnot, Brad
    Szafir, Danielle Albers
    Boyd-Graber, Jordan
    PLOS ONE, 2019, 14 (05):
  • [4] X-FACT: A New Benchmark Dataset for Multilingual Fact Checking
    Gupta, Ashim
    Srikumar, Vivek
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 675 - 682
  • [5] The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification
    Koksal, Abdullatif
    Ozgur, Arzucan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [6] Nonparametric Symmetric Correspondence Topic Models for Multilingual Text Analysis
    Cai, Rui
    Chen, Miaohong
    Wang, Houfeng
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 270 - 281
  • [7] Monolingual and multilingual topic analysis using LDA and BERT embeddings
    Xie, Qing
    Zhang, Xinyuan
    Ding, Ying
    Song, Min
    JOURNAL OF INFORMETRICS, 2020, 14 (03)
  • [8] Multilingual Image Corpus - Towards a Multimodal and Multilingual Dataset
    Koeva, Svetla
    Stoyanova, Ivelina
    Kralev, Jordan
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1509 - 1518
  • [9] Dataset Alignment and Lexicalization to Support Multilingual Analysis of Legal Documents
    Stellato A.
    Fiorelli M.
    Turbati A.
    Lorenzetti T.
    Schmitz P.
    Francesconi E.
    Hajlaoui N.
    Batouche B.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2018, 10791 : 257 - 271
  • [10] Sentiment Analysis of Multilingual Dataset of Bahraini Dialects, Arabic, and English
    Omran, Thuraya
    Sharef, Baraa
    Grosan, Crina
    Li, Yongmin
    DATA, 2023, 8 (04)