Towards Multilingual Automated Classification Systems

被引:2
|
作者
Musaev, Aibek [1 ]
Pu, Calton [2 ]
机构
[1] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
[2] Georgia Inst Technol, Sch Comp Sci, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICDCS.2017.208
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we propose and evaluate three approaches for automated classification of texts in over 60 languages without the need for a manually annotated dataset in those languages. All approaches are based on the randomized Explicit Semantic Analysis method using multilingual Wikipedia articles as their knowledge repository. We evaluate the proposed approaches by classifying a Twitter dataset in English and Portuguese into relevant and irrelevant items with respect to landslide as a natural disaster, where the highest achieved F1-score is 0.93. These approaches can be used in various applications where multilingual classification is needed, including multilingual disaster reporting using Social Media to improve coverage and increase confidence. As illustration, we present a demonstration that combines data from physical sensors and social networks to detect landslide events reported in English and Portuguese.
引用
收藏
页码:2333 / 2337
页数:5
相关论文
共 50 条
  • [11] Towards Automated Embedded Systems Programming
    Yusuf, Imam Nur Bani
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS, ICSE-COMPANION, 2023, : 224 - 226
  • [12] Towards Automated Robotic Nanomanipulation Systems
    Jasper, Daniel
    Edeler, Christoph
    Diederichs, Claas
    Naroska, Mirko
    Stolle, Christian
    Fatikow, Sergej
    2009 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, VOLS 1-3, 2009, : 94 - 99
  • [13] Towards Multilingual LLM-Based Approaches for Automatic Dewey Decimal Classification
    Ho, Clara Wan Ching
    Weber, Tobias
    Fritze, Thorsten
    Risse, Thomas
    LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, PT II, TPDL 2024, 2024, 15178 : 23 - 33
  • [14] Towards rapid and automated vulnerability classification of concrete buildings
    Iturburu, Lissette
    Kwannandar, Jean
    Dyke, Shirley J.
    Liu, Xiaoyu
    Zhang, Xin
    Ramirez, Julio
    EARTHQUAKE ENGINEERING AND ENGINEERING VIBRATION, 2023, 22 (02) : 309 - 332
  • [15] Towards the development of an automated wear particle classification system
    Stachowiak, G. W.
    Podsiadlo, P.
    TRIBOLOGY INTERNATIONAL, 2006, 39 (12) : 1615 - 1623
  • [16] Towards rapid and automated vulnerability classification of concrete buildings
    Lissette Iturburu
    Jean Kwannandar
    Shirley J. Dyke
    Xiaoyu Liu
    Xin Zhang
    Julio Ramirez
    Earthquake Engineering and Engineering Vibration, 2023, 22 : 309 - 332
  • [17] Towards rapid and automated vulnerability classification of concrete buildings
    Lissette Iturburu
    Jean Kwannandar
    Shirley J.Dyke
    Xiaoyu Liu
    Xin Zhang
    Julio Ramirez
    Earthquake Engineering and Engineering Vibration, 2023, 22 (02) : 309 - 332
  • [18] Towards Automated Classification of Seabed Substrates in Underwater Video
    Pugh, Matthew
    Tiddeman, Bernard
    Dee, Hannah
    Hughes, Philip
    2014 ICPR WORKSHOP ON COMPUTER VISION FOR ANALYSIS OF UNDERWATER IMAGERY (CVAUI 2014), 2014, : 9 - 16
  • [19] Towards Establishing Systematic Classification Requirements for Automated Driving
    Mori, Ken T.
    Brown, Trent
    Peters, Steven
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [20] Towards automated classification of intensive care nursing narratives
    Hiissa, Marketta
    Pahikkala, Tapio
    Suominen, Hanna
    Lehtikunnas, Tuija
    Back, Barbro
    Karsten, Helena
    Salantera, Sanna
    Salakoski, Tapio
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2007, 76 (SUPPL. 3) : S362 - S368