Automatic Assamese Text Categorization Using WordNet

被引:0
|
作者
Sarmah, Jumi [1 ]
Barman, Anup Kumar [1 ]
Sarma, Shikhar Kr. [1 ]
机构
[1] Gauhati Univ, Dept Informat Technol, Gauhati, India
来源
2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2013年
关键词
Text Categorization; Assamese WordNet; Word Sense Disambiguation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing rate of Assamese text contents in digital format encourages us to generate a system that automatically categorizes them. This paper discusses a system that will perform the categorization of texts automatically based on the knowledge from Assamese WordNet. In WordNet, synset correspond to the words which implies the same concept and words having more than one sense in a particular text content is disambiguated in this approach. This approach extracts words occurred in the document and uses them to create a synset vector with union to its corresponding synsets from WordNet. To increase our performance, we present a process where it increases the weight of not only the terms but also that of the synsets corresponding to the terms. We later count the occurrences of the senses that help in disambiguation tasks by propagating the relationship between synsets. The proposed method outcomes with a reasonable state of art accuracy when measured with Precision and Recall.
引用
收藏
页码:85 / 89
页数:5
相关论文
共 50 条
  • [1] Using WordNet for text categorization
    Elberrichi, Zakaria
    Rahmoun, Abdelattif
    Bentaalah, Mohamed Amine
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2008, 5 (01) : 16 - 24
  • [2] Fully Automatic Text Categorization by Exploiting WordNet
    Li, Jianqiang
    Zhao, Yu
    Liu, Bo
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 1 - 12
  • [3] Using kNN model for automatic text categorization
    Guo, GD
    Wang, H
    Bell, D
    Bi, YX
    Greer, K
    SOFT COMPUTING, 2006, 10 (05) : 423 - 430
  • [4] Using kNN model for automatic text categorization
    Gongde Guo
    Hui Wang
    David Bell
    Yaxin Bi
    Kieran Greer
    Soft Computing, 2006, 10 : 423 - 430
  • [5] A WordNet-based approach to feature selection in text categorization
    Zhang, K
    Sun, J
    Wang, B
    INTELLIGENT INFORMATION PROCESSING II, 2005, 163 : 475 - 484
  • [6] Automatic Text Categorization of Marathi Documents Using Clustering Technique
    Vispute, Sushma R.
    Potey, M. A.
    2013 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING TECHNOLOGIES (ICACT), 2013,
  • [7] Automatic text categorization and its application to text retrieval
    Lam, W
    Ruiz, M
    Srinivasan, P
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (06) : 865 - 879
  • [8] Automatic text categorization with learning logic
    Al-Mubaid, H
    Siddiqui, MS
    COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2003, : 178 - 183
  • [9] Automatic text categorization:: Case study
    Corrêa, RF
    Ludermir, TB
    VII BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, PROCEEDINGS, 2002, : 150 - 150
  • [10] WordNet Based Information Retrieval System for Assamese
    Barman, Anup Kumar
    Sarmah, Jumi
    Sarma, Shikhar Kr.
    UKSIM-AMSS 15TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM 2013), 2013, : 480 - 484