Graph based KNN for Text Categorization

被引:0
作者
Jo, Taeho [1 ]
机构
[1] Hongik Univ, Sch Games, Sejong, South Korea
来源
2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT) | 2018年
关键词
Text Categorization; Graph Similarity; Graph based KNN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this research, we propose the graph based KNN where a graph is given as input, instead of a numerical vector, as the approach to the text categorization tasks. The ontology which is given as a graph has been used as the popular and standard knowledge representation which is understandable by computers, so it is regarded as more natural scheme to encode texts into graphs, than numerical vectors. In this research, we encode texts into graphs, define the similarity measure between graphs, and modify the K Nearest Neighbor into its graph based version as the text categorization tool. As the benefit from this research, we expect the more compact, graphical, and symbolic representation of texts, than numerical vectors. Therefore, the goal of this research is to implement the text categorization system with the better performance and more user-friendly representations of texts
引用
收藏
页码:260 / 265
页数:6
相关论文
共 15 条
  • [1] Allemang D., 2011, SEMANTIC WEB WORKING, V2nd ed., P249, DOI [10.1016/B978-0-12-385965-5.10012-3, DOI 10.1016/B978-0-12-385965-5.10012-3]
  • [2] Jo T., 2011, Patent Document, Patent No. [10-2009-0041272, 10-1071495, 1020090041272]
  • [3] Jo T., 2006, THESIS
  • [4] Jo T., 2008, INT J MATH COMPUTERS, V2
  • [5] Jo T., 2010, J NETWORK TECHNOLOGY, V1, P31
  • [6] Normalized table-matching algorithm as approach to text categorization
    Jo, Taeho
    [J]. SOFT COMPUTING, 2015, 19 (04) : 839 - 849
  • [7] Inverted Index based Modified Version of K-Means Algorithm for Text Clustering
    Jo, Taeho
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2008, 4 (02): : 67 - 76
  • [8] Kate RJ, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P913
  • [9] Mismatch string kernels for discriminative protein classification
    Leslie, CS
    Eskin, E
    Cohen, A
    Weston, J
    Noble, WS
    [J]. BIOINFORMATICS, 2004, 20 (04) : 467 - 476
  • [10] Text classification using string kernels
    Lodhi, H
    Saunders, C
    Shawe-Taylor, J
    Cristianini, N
    Watkins, C
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) : 419 - 444