Information Extraction System Based on Hidden Markov Model

被引:0
作者
Park, Dong-Chul [1 ]
Huong, Vu Thi Lan [1 ]
Woo, Dong-Min [1 ]
Hieu, Duong Ngoc [2 ]
Ninh, Sai Thi Hien [2 ]
机构
[1] Myong Ji Univ, Dept Informat Engn, Yongin, South Korea
[2] Ho Chi Minh City Univ Technol, Fac Comp Sci & Engn, Ho Chi Minh, Vietnam
来源
ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 1, PROCEEDINGS | 2009年 / 5551卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel approach using Hidden Markov Model (HMM) for the task of finding prices of products on internet sites is proposed in this paper. The proposed Information Extraction System based on HMM (IESHMM) utilizes HMM for its capability to process temporal information. The proposed IESHMM first processes web pages that are returned from search engines and then extracts specific fields Such as prices, descriptions, locations, images of products, and other information of interest. The proposed IESHMM is evaluated with real-world problems and compared with a conventional method. The results show that the proposed IESHMM outperforms the other method by 22.9% and 37.2% in terms of average recall and average precision, respectively.
引用
收藏
页码:52 / +
页数:2
相关论文
共 9 条
[1]  
BUTTLER D, 2001, P IEEE ICDCS, V361
[2]  
Chang C., 2001, P WORLD WID WEB C, P682
[3]  
Freitag D., 1999, A A A I Workshop on Machine Learning for Information Extraction, P31
[4]  
Leek T.R., 1997, THESIS UC SAN DIEGO
[5]  
Phan XH, 2004, IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, P590
[6]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[7]   A CRITICAL INVESTIGATION OF RECALL AND PRECISION AS MEASURES OF RETRIEVAL-SYSTEM PERFORMANCE [J].
RAGHAVAN, VV ;
JUNG, GS ;
BOLLMANN, P .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1989, 7 (03) :205-229
[8]  
Rilo E., 1999, P 16 NAT C ART INT, P811
[9]   Learning information extraction rules for semi-structured and free text [J].
Soderland, S .
MACHINE LEARNING, 1999, 34 (1-3) :233-272