Towards Multilingual Automated Classification Systems

被引:2
|
作者
Musaev, Aibek [1 ]
Pu, Calton [2 ]
机构
[1] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
[2] Georgia Inst Technol, Sch Comp Sci, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICDCS.2017.208
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we propose and evaluate three approaches for automated classification of texts in over 60 languages without the need for a manually annotated dataset in those languages. All approaches are based on the randomized Explicit Semantic Analysis method using multilingual Wikipedia articles as their knowledge repository. We evaluate the proposed approaches by classifying a Twitter dataset in English and Portuguese into relevant and irrelevant items with respect to landslide as a natural disaster, where the highest achieved F1-score is 0.93. These approaches can be used in various applications where multilingual classification is needed, including multilingual disaster reporting using Social Media to improve coverage and increase confidence. As illustration, we present a demonstration that combines data from physical sensors and social networks to detect landslide events reported in English and Portuguese.
引用
收藏
页码:2333 / 2337
页数:5
相关论文
共 50 条
  • [31] Intelligent transport systems towards automated vehicles
    Oyama, Satoshi
    ITU News, 2019, 2019 (04): : 29 - 32
  • [32] Towards the Automated Generation of Focused Proof Systems
    Nigam, Vivek
    Reis, Giselle
    Lima, Leonardo
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2015, (197): : 1 - 6
  • [33] Towards automated creation of image interpretation systems
    Levner, I
    Bulitko, V
    Li, LH
    Lee, G
    Greiner, R
    AI 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2003, 2903 : 653 - 665
  • [34] Multilingual Image Corpus - Towards a Multimodal and Multilingual Dataset
    Koeva, Svetla
    Stoyanova, Ivelina
    Kralev, Jordan
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1509 - 1518
  • [35] Development of Automated Optical Inspection and Classification Systems
    Tsai, Pu-Sheng
    Wu, Ter-Feng
    Chen, Jen-Yang
    Tsai, Chia-Luen
    SENSORS AND MATERIALS, 2022, 34 (10) : 3895 - 3910
  • [36] Towards Automated Exploit Generation for Embedded Systems
    Ruffell, Matthew
    Hong, Jin B.
    Kim, Hyoungshick
    Kim, Dong Seong
    INFORMATION SECURITY APPLICATIONS, WISA 2016, 2017, 10144 : 161 - 173
  • [37] Development of an automated classification procedure for rainfall systems
    Baldwin, ME
    Kain, JS
    Lakshmivarahan, S
    MONTHLY WEATHER REVIEW, 2005, 133 (04) : 844 - 862
  • [38] Towards automated restructuring of object oriented systems
    Trifu, Adrian
    Reupke, Urs
    CSMR 2007: 11TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING, PROCEEDINGS: SOFWARE EVOLUTION IN COMPLEX SOFTWARE INTENSIVE SYSTEMS, 2007, : 39 - +
  • [39] Towards Automated Emotion Classification of Atypically and Typically Developing Infants
    Lysenko, Sofiya
    Seethapathi, Nidhi
    Prosser, Laura
    Kording, Konrad
    Johnson, Michelle J.
    2020 8TH IEEE RAS/EMBS INTERNATIONAL CONFERENCE FOR BIOMEDICAL ROBOTICS AND BIOMECHATRONICS (BIOROB), 2020, : 503 - 508
  • [40] Robust tracking and object classification towards automated video surveillance
    Landabaso, JL
    Xu, LQ
    Pardas, M
    IMAGE ANALYSIS AND RECOGNITION, PT 2, PROCEEDINGS, 2004, 3212 : 463 - 470