Graph based KNN for Text Categorization

被引:0
作者
Jo, Taeho [1 ]
机构
[1] Hongik Univ, Sch Games, Sejong, South Korea
来源
2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT) | 2018年
关键词
Text Categorization; Graph Similarity; Graph based KNN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this research, we propose the graph based KNN where a graph is given as input, instead of a numerical vector, as the approach to the text categorization tasks. The ontology which is given as a graph has been used as the popular and standard knowledge representation which is understandable by computers, so it is regarded as more natural scheme to encode texts into graphs, than numerical vectors. In this research, we encode texts into graphs, define the similarity measure between graphs, and modify the K Nearest Neighbor into its graph based version as the text categorization tool. As the benefit from this research, we expect the more compact, graphical, and symbolic representation of texts, than numerical vectors. Therefore, the goal of this research is to implement the text categorization system with the better performance and more user-friendly representations of texts
引用
收藏
页码:260 / 265
页数:6
相关论文
共 15 条
  • [11] Noy N.F., 1997, AI MAGAZINE, V18
  • [12] Machine learning in automated text categorization
    Sebastiani, F
    [J]. ACM COMPUTING SURVEYS, 2002, 34 (01) : 1 - 47
  • [13] Taeho Jo, 2010, Journal of Computing Science and Engineering, V4, P110
  • [14] Taeho Jo, 2010, International Journal of Information Studies, V2, P83
  • [15] 조태호, 2008, [JOURNAL OF KOREA MULTIMEDIA SOCIETY, 멀티미디어학회논문지], V11, P1749