Text Categorization of Marathi Documents using Modified LINGO

被引:0
作者
Narhari, Shraddha A. [1 ]
Shedge, Rajashree [1 ]
机构
[1] Ramrao Adik Inst Technol, Dept Comp Engn, Mumbai, Maharashtra, India
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL (ICAC3) | 2017年
关键词
Text Categorization; Dimension reduction; Morphological analysis LINGO; SVD; PCA;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the 21st century we have seen tremendous growth in the users who are accessing internet. At the same time users across the universe are giving preferences to the regional languages while accessing internet. It is seen that there is a huge scope for doing categorization in regional languages, considering the fact that search engine has worked on many regional languages. Moreover, super-fast acceleration in the information technology has driven the variety of applications of text categorization. We have chosen Marathi as a regional language for automatic text categorization. Categorization defines various classification and clustering techniques and Label Induction Grouping (LINGO) is one of these algorithms used for categorization of Marathi text documents. This uses Single Value Decomposition (SVD) for dimension reduction. We are proposing a modified LINGO algorithm which is considering morphology of Marathi text and Principle Component analysis (PCA) for improving results.
引用
收藏
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 2014, International Journal of Artificial Intelligence & Applications (IJAIA)
[2]  
Choudhary Narayan, 2014, INT J EMERGING TECHN
[3]  
Desai Nikita P., 2015, SURVEY TEXT CATEGORI, V43, P118
[4]  
Durga A.K., 2011, International Journal of Scientific and Engineering Research, V2, P1
[5]  
Govilkar Sharvari, 2016, INT J COMPUTER APPL
[6]  
Hanumanthappa2 M., 2014, INT C INT COMP APPL, V37, P30
[7]  
Kannan S., 2014, Preprocessing techniques for text mining
[8]  
Krail N., 2012, 3rd Workshop on South and Southeast Asian Natural Language Processing (WSSANLP@COLING), P109
[9]  
Muley Aditi, 2014, INT J COMPUTER SCI C, V4, P13
[10]  
Niharika S., 2012, INT J COMPUTER TREND, V3, P39