Automatic Assamese Text Categorization Using WordNet

被引:0
|
作者
Sarmah, Jumi [1 ]
Barman, Anup Kumar [1 ]
Sarma, Shikhar Kr. [1 ]
机构
[1] Gauhati Univ, Dept Informat Technol, Gauhati, India
来源
2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2013年
关键词
Text Categorization; Assamese WordNet; Word Sense Disambiguation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing rate of Assamese text contents in digital format encourages us to generate a system that automatically categorizes them. This paper discusses a system that will perform the categorization of texts automatically based on the knowledge from Assamese WordNet. In WordNet, synset correspond to the words which implies the same concept and words having more than one sense in a particular text content is disambiguated in this approach. This approach extracts words occurred in the document and uses them to create a synset vector with union to its corresponding synsets from WordNet. To increase our performance, we present a process where it increases the weight of not only the terms but also that of the synsets corresponding to the terms. We later count the occurrences of the senses that help in disambiguation tasks by propagating the relationship between synsets. The proposed method outcomes with a reasonable state of art accuracy when measured with Precision and Recall.
引用
收藏
页码:85 / 89
页数:5
相关论文
共 50 条
  • [21] Automatic category theme identification and hierarchy generation for Chinese text categorization
    Yang, HC
    Lee, CH
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2005, 25 (01) : 47 - 67
  • [22] Content-Oriented Automatic Text Categorization with the Cognitive Situation Models
    Guo, Yi
    Shao, Zhiqing
    Nan, Hua
    ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 512 - +
  • [23] Text categorization using adaptive context trees
    Vert, JP
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2001, 2004 : 423 - 436
  • [24] Enhancing Text Categorization Using Sentence Semantics
    Shehata, Shady
    Karray, Fakhri
    Kamel, Mohamed
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 87 - 98
  • [25] Text categorization using informative vector machines
    Stankovic, M
    Moustakis, V
    Stankovic, S
    Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings, 2005, : 209 - 212
  • [26] Text categorization using character shape codes
    Spitz, AL
    Maghbouleh, A
    DOCUMENT RECOGNITION AND RETRIEVAL VII, 2000, 3967 : 174 - 181
  • [27] Text categorization using Copula Function: An Overview
    Hammami, Nacereddine
    Goudjil, Mohamed
    Alruily, Meshrif
    WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,
  • [28] Improving text categorization using the importance of sentences
    Ko, Y
    Park, J
    Seo, J
    INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (01) : 65 - 79
  • [29] Automatic generation of text categorization rules in a hybrid method based on machine learning
    Lana-Serrano, Sara
    Villena-Roman, Julio
    Collada-Perez, Sonia
    Carlos Gonzalez-Cristobal, Jose
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 231 - 237
  • [30] Automatic Text Categorization by a Granular Computing Approach: facing Unbalanced Data Sets
    Possemato, Francesca
    Rizzi, Antonello
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,