Proposed Architecture for Automatic Conversion of Unstructured Text Data into Structured Text Data on the Web

被引:0
作者
Madhusudhan, Ch. [1 ]
Rao, K. Mrithyunjaya [2 ]
机构
[1] St Johns Inst Sci & Technol, Vaagdevi Coll Engn & Technol, Dept MCA, Warangal, Andhra Prades, India
[2] St Johns Inst Sci & Technol, Vaagdevi Coll Engn & Technol, Dept CSE, Warangal, Andhra Prades, India
来源
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY | 2013年 / 13卷 / 12期
关键词
Data Mining; Text Mining; Text Classification; Text Mining Methods;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining, and in particular text mining, has attracted much attention in recent years due to the vast amounts of data available, and the rate of growth. Data mining tools can be used to uncover patterns or hidden relations in the available data, and can potentially contribute greatly to business strategy decisions, knowledge bases, and scientific and medical research. In contrast to data mining, where one looks for patterns and knowledge in structured databases, text mining deals with unstructured, or semi structured, text data such as reports, e-mails or web-pages.
引用
收藏
页码:110 / 116
页数:7
相关论文
共 16 条
  • [1] BAEZA-YATES R., 1990, MODERN INFORM RETRIE
  • [2] Berry M, 2003, SURVEY TEXT MINING C
  • [3] CHU W, 1996, COMMUNICATIONS ACM, V39
  • [4] DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
  • [5] 2-9
  • [6] DUDA R. O., 2000, PATTERN CLASSIFICATI
  • [7] Dunham M.H., 2003, DATA MINING INTERDIC
  • [8] Fayyad U, 1996, ADV KNOWLEDGE DISCOV, P1
  • [9] Giuffrida G., 2000, P 12 INT C EXT DAT E
  • [10] GROTH R, 2000, DATA MINING BUILDING