Text classification for DAG-structured categories

被引:0
|
作者
Nguyen, CD [1 ]
Dung, TA [1 ]
Cao, TH [1 ]
机构
[1] Ho Chi Minh City Univ Technol, Fac Informat Technol, Ho Chi Minh City, Vietnam
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS | 2005年 / 3518卷
关键词
text classification; hierarchies; multi-labels; SVM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical text classification concerning the relationship among categories has become an interesting problem recently. Most research has focused on tree-structured categories, but in reality directed acyclic graph (DAG) - structured categories, where a child category may have more than one parent category, appear more often. In this paper, we introduce three approaches, namely, flat, tree-based, and DAG-based, for solving the multi-label text classification problem in which categories are organized as a DAG, and documents are classified into both leaf and internal categories. We also present experimental results of the methods using SVMs as classifiers on the Reuters-21578 collection and our data set of research papers in Artificial Intelligence.
引用
收藏
页码:290 / 300
页数:11
相关论文
共 50 条
  • [41] The impact of preprocessing on text classification
    Uysal, Alper Kursat
    Gunal, Serkan
    INFORMATION PROCESSING & MANAGEMENT, 2014, 50 (01) : 104 - 112
  • [42] Feature Selection in Text Classification
    Sahin, Durmus Ozkan
    Ates, Nurullah
    Kilic, Erdal
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1777 - 1780
  • [43] Uncertainty Quantification for Text Classification
    Zhang, Dell
    Sensoy, Murat
    Makrehchi, Masoud
    Taneva-Popova, Bilyana
    Gui, Lin
    He, Yulan
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3426 - 3429
  • [44] Uncertainty Quantification for Text Classification
    Zhang, Dell
    Sensoy, Murat
    Makrehchi, Masoud
    Taneva-Popova, Bilyana
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 362 - 369
  • [45] Automatic Classification of Text Complexity
    Santucci, Valentino
    Santarelli, Filippo
    Forti, Luciana
    Spina, Stefania
    APPLIED SCIENCES-BASEL, 2020, 10 (20): : 1 - 19
  • [46] Text classification using capsules
    Kim, Jaeyoung
    Jang, Sion
    Park, Eunjeong
    Choi, Sungchul
    NEUROCOMPUTING, 2020, 376 : 214 - 221
  • [47] Chinese Text Classification Review
    Ma, Yajing
    Li, Yonghong
    Zhang, Xiang
    Wu, Xiaolong
    2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 737 - 739
  • [48] Text Classification on a Grid Environment
    Roncero, Valeriana G.
    Costa, Myrian C. A.
    Ebecken, Nelson F. F.
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2010, 2011, 6449 : 251 - 262
  • [49] Classifying Text with Statistically Selected Features to Closely Related Categories
    Meena, M. Janaki
    Chandran, K. R.
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 297 - +
  • [50] Arabic Text Classification: A Comparative Approach Using a Big Dataset
    Madhfar, Mokhtar Ali Hasan
    Al-Hagery, Mohammed Abdullah Hassan
    2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 465 - 469