A Conceptual Design of a Web Information Extraction and Data Analysis Learning Framework

被引:0
作者
Tseng, Chun-Hsiung [1 ]
Chen, Yung-Hui [2 ]
Jiang, Yan-Ru [1 ]
机构
[1] Nanhua Univ, Dept Informat Management, Dalin, Chiayi County, Taiwan
[2] Lunghwa Univ Sci & Technol, Dept Comp Informat & Network Engn, Taoyuan, Taiwan
来源
2015 8TH INTERNATIONAL CONFERENCE ON UBI-MEDIA COMPUTING (UMEDIA) CONFERENCE PROCEEDINGS | 2015年
关键词
Information Extraction; Learning; DataAnalyzation; Data Mining;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Web is flooded with data. However, there is a huge gap between data and information. Collecting, normalization, and analyzation are required steps to transform data into information. However, HTML is document-centric rather than data-centric. Extracting large amounts of data from the Web is a time consuming and tedious task, but information technologies can only provide little help, especially when users lack of domain knowledge. In this research, the conceptual design of a Web information extraction and data analysis framework is proposed. The framework helps data analysts go through the required steps. Furthermore, our design is suitable for inexperienced beginners in data analyzation field since some assistant modules have been considered.
引用
收藏
页码:124 / 127
页数:4
相关论文
共 12 条
  • [1] Hu Xiaohua, 2004, ONTOLOGY BASED SCALA
  • [2] Karacapilidis Nikos, 2014, STRENGTHENING COLLAB, P1005
  • [3] Kathi Sheetal, 2014, PARALLEL PREPROCESSI
  • [4] Nguyen Minh-Tien, 2013, EXTRACTION DIS EVENT, P139
  • [5] Padmadas V, 2010, WEB DATA EXTRACION U, P218
  • [6] Patii Ujwala Manoj, 2012, WEB DATA MINING TREN, P961
  • [7] Raghavan Sindhu, 2012, LEARNING READ LINES, P349
  • [8] Sghaier Manei, 2008, FFTM OPTIMIZED FREQU, P419
  • [9] Song Min, 2003, KPSPOTTER FLEXIBLE I, P50
  • [10] Tseng Chun Hsiung, 2014, P UB MED COMP UMEDIA, P300