Machine Learning for Automated Tender Classification

被引:0
作者
Goswami, Sumit [1 ]
Kapoor, Sunaina [2 ]
Bhardwaj, Prakriti [3 ]
机构
[1] DRDO, Dte Mgmt Informat Syst & Technol MIST, New Delhi, India
[2] Indira Gandhi Inst Technol, Dept Comp Sci & Engn, Delhi, India
[3] JSS Acad Tech Educ, Dept Informat Technol, Noida, India
来源
2011 ANNUAL IEEE INDIA CONFERENCE (INDICON-2011): ENGINEERING SUSTAINABLE SOLUTIONS | 2011年
关键词
tenderdtocument classification; machine learning; text mining; naive bayes classifier; DRDO;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents classification of DRDO tender documents into predefined categories. Since there is a consistent growth in the volume of digital documents, both on the internet and within organizations, the need to classify them into categories is obvious. In this paper we used `bag-of-words' technique to represent the tender documents. The dataset was prepared and fed into Weka toolkit. Classification was implemented by Naive Bayes classifier using 10-folds cross validation technique. The machine resulted in classifying the tender documents with an accuracy of 77.36% by technology category and 67.2% by lab's name.
引用
收藏
页数:4
相关论文
共 13 条
[1]  
Baeza-Yates R, 1999, MODERN INFORM RETRIE, V463
[2]  
BAI R, 2009, FOLKSONOMY BLOGOSPHE, V3, P631
[3]  
Bouckaert R.R., 2011, WEKA Manual for Version 3-7-4
[4]  
DEVI MI, 2007, APPLICATIONS, P116, DOI DOI 10.1109/ICCIMA.2007.342
[5]  
Eldén L, 2007, FUND ALGORITHMS, V4, pIX, DOI 10.1137/1.9780898718867
[6]  
Goswami S., 2009, Proceedings of the International AAAI Conference on Web and Social Media, V3, P214, DOI DOI 10.1609/ICWSM.V3I1.13992
[7]  
Lam W, 1997, IEEE SYS MAN CYBERN, P2719, DOI 10.1109/ICSMC.1997.635349
[8]   Classification of text documents [J].
Li, YH ;
Jain, AK .
COMPUTER JOURNAL, 1998, 41 (08) :537-546
[9]  
Rak R, 2005, ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, P177
[10]   COMPUTER EVALUATION OF INDEXING AND TEXT PROCESSING [J].
SALTON, G ;
LESK, ME .
JOURNAL OF THE ACM, 1968, 15 (01) :8-&