Towards Multilingual Automated Classification Systems

被引:2
|
作者
Musaev, Aibek [1 ]
Pu, Calton [2 ]
机构
[1] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
[2] Georgia Inst Technol, Sch Comp Sci, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICDCS.2017.208
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we propose and evaluate three approaches for automated classification of texts in over 60 languages without the need for a manually annotated dataset in those languages. All approaches are based on the randomized Explicit Semantic Analysis method using multilingual Wikipedia articles as their knowledge repository. We evaluate the proposed approaches by classifying a Twitter dataset in English and Portuguese into relevant and irrelevant items with respect to landslide as a natural disaster, where the highest achieved F1-score is 0.93. These approaches can be used in various applications where multilingual classification is needed, including multilingual disaster reporting using Social Media to improve coverage and increase confidence. As illustration, we present a demonstration that combines data from physical sensors and social networks to detect landslide events reported in English and Portuguese.
引用
收藏
页码:2333 / 2337
页数:5
相关论文
共 50 条
  • [1] Towards an Automated Classification of Spreadsheets
    Mendes, Jorge
    Do, Kha N.
    Saraiva, Joao
    SOFTWARE TECHNOLOGIES: APPLICATIONS AND FOUNDATIONS (STAF 2016), 2016, 9946 : 346 - 355
  • [2] Towards an Online Multilingual Tool for Automated Conceptual Database Design
    Brdjanin, Drazen
    Grumic, Mladen
    Banjac, Goran
    Miscevic, Milan
    Dujlovic, Igor
    Kelec, Aleksandar
    Obradovic, Nikola
    Banjac, Danijela
    Volas, Dragana
    Maric, Slavko
    INTELLIGENT DISTRIBUTED COMPUTING XV, IDC 2022, 2023, 1089 : 144 - 153
  • [3] Modeling Classification Systems in Multicultural and Multilingual Contexts
    Mitchell, Joan
    Zeng, Marcia
    Zumer, Maja
    CATALOGING & CLASSIFICATION QUARTERLY, 2014, 52 (01) : 90 - 101
  • [4] Towards an Automated Classification of Software Libraries
    Auch M.
    Balluff M.
    Mandl P.
    Wolff C.
    SN Computer Science, 5 (4)
  • [5] Towards automated bone fracture classification
    Funk, MW
    El-Kwae, EA
    Kellam, JF
    MEDICAL IMAGING: 2001: IMAGE PROCESSING, PTS 1-3, 2001, 4322 : 755 - 765
  • [6] Towards the automated classification system of worn surfaces
    Wolski, M.
    Woloszynski, T.
    Stachowiak, G. W.
    Podsiadlo, P.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART J-JOURNAL OF ENGINEERING TRIBOLOGY, 2020, 234 (08) : 1265 - 1274
  • [7] Towards a Common Classification of Changes for Information and Automated Production Systems as Precondition for Maintenance Effort Estimation
    Vogel-Heuser, Birgit
    Simon, Thomas
    Folmer, Jens
    Heinrich, Robert
    Rostami, Kiana
    Reussner, Ralf
    2016 IEEE 14TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2016, : 166 - 172
  • [8] Methods of training set construction: Towards improving performance for automated mesozooplankton image classification systems
    Chang, Chun-Yi
    Ho, Pei-Chi
    Sastri, Akash R.
    Lee, Yu-Ching
    Gong, Gwo-Ching
    Hsieh, Chih-hao
    CONTINENTAL SHELF RESEARCH, 2012, 36 : 19 - 28
  • [9] Towards automated deduction in cP systems
    Liu, Yezhou
    Nicolescu, Radu
    Sun, Jing
    INFORMATION SCIENCES, 2022, 587 : 435 - 449
  • [10] Towards Acceptance of Automated Driving Systems
    Jamson, Samantha
    Risvas, Konstantinos
    Naveiro, Roi
    Rios Insua, David
    Moustakas, Konstantinos
    Kruszewski, Mikolaj
    Rodak, Aleksandra
    Barisone, Alessandro
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COMPUTER-HUMAN INTERACTION RESEARCH AND APPLICATIONS (CHIRA), 2021, : 232 - 239