Automatic Assamese Text Categorization Using WordNet

被引:0
|
作者
Sarmah, Jumi [1 ]
Barman, Anup Kumar [1 ]
Sarma, Shikhar Kr. [1 ]
机构
[1] Gauhati Univ, Dept Informat Technol, Gauhati, India
来源
2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2013年
关键词
Text Categorization; Assamese WordNet; Word Sense Disambiguation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing rate of Assamese text contents in digital format encourages us to generate a system that automatically categorizes them. This paper discusses a system that will perform the categorization of texts automatically based on the knowledge from Assamese WordNet. In WordNet, synset correspond to the words which implies the same concept and words having more than one sense in a particular text content is disambiguated in this approach. This approach extracts words occurred in the document and uses them to create a synset vector with union to its corresponding synsets from WordNet. To increase our performance, we present a process where it increases the weight of not only the terms but also that of the synsets corresponding to the terms. We later count the occurrences of the senses that help in disambiguation tasks by propagating the relationship between synsets. The proposed method outcomes with a reasonable state of art accuracy when measured with Precision and Recall.
引用
收藏
页码:85 / 89
页数:5
相关论文
共 50 条
  • [11] Automatic categorization of web text documents using fuzzy inference rule
    Ankita Dhar
    Himadri Mukherjee
    Niladri Sekhar Dash
    Kaushik Roy
    Sādhanā, 2020, 45
  • [12] Automatic categorization of web text documents using fuzzy inference rule
    Dhar, Ankita
    Mukherjee, Himadri
    Dash, Niladri Sekhar
    Roy, Kaushik
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2020, 45 (01):
  • [13] Automatic text categorization based on angle distribution
    Liu, T
    Guo, J
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3797 - 3801
  • [14] Automatic expert identification using a text categorization technique in knowledge management systems
    Yang, Kun-Woo
    Huh, Soon-Young
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (02) : 1445 - 1455
  • [15] Research on Chinese Text Automatic Categorization Based on VSM
    Tong Xiao-Jun
    Cui Ming-Gen
    Song Guo-Long
    2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 3863 - +
  • [16] Automatic Multilabel Categorization using Learning to Rank Framework for Complaint Text on Bandung Government
    Fauzan, Ahmad
    Khodra, Masayu Leylia
    2014 INTERNATIONAL CONFERENCE OF ADVANCED INFORMATICS: CONCEPT, THEORY AND APPLICATION (ICAICTA), 2014, : 28 - 33
  • [17] A Comprehensive Analysis of using Semantic Information in Text Categorization
    Celik, Kerem
    Gungor, Tunga
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [18] Supervised and Traditional Term Weighting Methods for Automatic Text Categorization
    Lan, Man
    Tan, Chew Lim
    Su, Jian
    Lu, Yue
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (04) : 721 - 735
  • [19] Automatic Category Theme Identification and Hierarchy Generation for Chinese Text Categorization
    Hsin-Chang Yang
    Chung-Hong Lee
    Journal of Intelligent Information Systems, 2005, 25 : 47 - 67
  • [20] Automatic text categorization based on content analysis with cognitive situation models
    Guo, Yi
    Shao, Zhiqing
    Hua, Nan
    INFORMATION SCIENCES, 2010, 180 (05) : 613 - 630