The Research and Design of Web Text Mining System Framework

被引:0
作者
Meng, Fanrong [1 ]
Jiang, Xiaoyun [1 ]
Shen, Lijun
Shi, Lei
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221008, Jiangsu, Peoples R China
来源
DCABES 2008 PROCEEDINGS, VOLS I AND II | 2008年
基金
中国国家自然科学基金;
关键词
web data mining; maximum matching method; vector space model; text classification; web text mining system framework;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the flood of the data on the Web, Web data mining has become the focus of the data mining technology This paper introduce the conception of Web Mining, analysis the difference between Web Mining and Data Mining. On the base of improving Maximum Matching Method, studying Vector Space Model and text classification algorithm, provide a Web text mining system framework, design the module of the framework and validate the system lastly.
引用
收藏
页码:400 / +
页数:3
相关论文
共 11 条
[1]  
CHEN L, 2001, XI DIAN U SCI, V28, P114
[2]   Automatic Labeling of semantic roles [J].
Gildea, D ;
Jurafskyy, D .
COMPUTATIONAL LINGUISTICS, 2002, 28 (03) :245-288
[3]  
HAN JW, 2001, J COMPUTER RES DEV, V38, P405
[4]  
KINGSBURY P, 2003, P TREEBANKS LEXICAL
[5]  
Kosala Raymond., 2000, SIGKDD EXPLOR NEWSL, V2, P1, DOI DOI 10.1145/360402.360406
[6]  
LIU Z, 2004, AUTOMATIC CHINESE TE, P20
[7]  
MARKOV A, 2006, P WEBKDD 2006 PHIL P
[8]  
MLADENIC D, P TEXT MIN WORKSH 10
[9]  
STEINBACH M, 2000, KNOWLEDGE DISCOVERY
[10]  
STREHL A, 2000, P AAAI WORKSH AI WEB, P58