KNN based machine learning approach for text and document mining

被引:0
作者
Bijalwan, Vishwanath [1 ]
Kumar, Vinay [2 ]
Kumari, Pinki [3 ]
Pascual, Jordan [4 ]
机构
[1] Institute of technology Gopeshwar, Chamoli, Uttarakhand
[2] GLA University, Mathura
[3] Bansathali University, Rajasthan
[4] Department of Computer Science, University of Oviedo
来源
International Journal of Database Theory and Application | 2014年 / 7卷 / 01期
关键词
Document mining; Event models; KNN; Machine learning; Naïve bayes; Term-graph; Text mining;
D O I
10.14257/ijdta.2014.7.1.06
中图分类号
学科分类号
摘要
Text Categorization (TC), also known as Text Classification, is the task of automatically classifying a set of text documents into different categories from a predefined set. If a document belongs to exactly one of the categories, it is a single-label classification task; otherwise, it is a multi-label classification task. TC uses several tools from Information Retrieval (IR) and Machine Learning (ML) and has received much attention in the last years from both researchers in the academia and industry developers. In this paper, we first categorize the documents using KNN based machine learning approach and then return the most relevant documents. © 2014 SERSC.
引用
收藏
页码:61 / 70
页数:9
相关论文
共 20 条
[11]  
Kumar K.S., Prasad S., Banwral S., Semwal V.B., Sports Video Summarization using Priority Curve Algorithm, International Journal, 2, (2010)
[12]  
Kumar K.S., Semwal V.B., Prasad S., Tripathi R.C., Generating 3D Model Using 2D Images of an Object, International Journal of Engineering Scienc, (2011)
[13]  
Semwal V.B., Kumar K.S., Bhaskar V.S., Sati M., Accurate location estimation of moving object with energy constraint & adaptive update algorithms to save data, (2011)
[14]  
Gupta J.P., Singh N., Dixit P., Semwal V.B., Dubey S.R., Human Activity Recognition using Gait Pattern, International Journal of Computer Vision and Image Processing, 3, 3, pp. 31-53, (2013)
[15]  
Vikash V., Semwal V.B., Kumari P., Dubey S.R., A fault-tolerant mobile computing model based on scalable replica, International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI), (2014)
[16]  
Dubey S.R., Dixit P., Singh N., Gupta J.P., Infected Fruit Part Detection using K-Means Clustering Segmentation Technique, International Journal of Interactive Multimedia and Artificial Intelligence, 2, pp. 65-72, (2013)
[17]  
Dubey S.R., Jalal A.S., Detection and Classification of Apple Fruit Diseases Using Complete Local Binary Patterns, In the Proceedings of the Third International Conference on Computer and Communication Technology, pp. 346-351, (2012)
[18]  
Singh N., Dubey S.R., Dixit P., Gupta J.P., Semantic Image Retrieval by Combining Color, Texture and Shape Features, In the Proceedings of the International Conference on Computing Sciences, pp. 116-120, (2012)
[19]  
Dubey S.R., Jalal A.S., Species and variety detection of fruits and vegetables from images, International Journal of Applied Pattern Recognition, 1, 1, pp. 108-126, (2013)
[20]  
Dubey S.R., Jalal A.S., Robust Approach for Fruit and Vegetable Classification, Procedia Engineering, 38, pp. 3449-3453, (2012)