Ancient medical literature semantic annotation using hidden markov models

被引:0
作者
Weng, Heng [1 ]
He, Wenxin [1 ]
Ou, Aihua [1 ]
Deng, Lili [1 ]
He, Chong [2 ]
Li, Huihui [2 ]
Yan, Shixing [2 ]
机构
[1] Guangdong Prov Hosp Tradit Chinese Med, Dept Big Med Data, Guangzhou, Guangdong, Peoples R China
[2] Tongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
来源
2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM) | 2014年
关键词
ancient literature; traditional Chinese medicine; semantic annotation; hidden markov models;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Traditional Chinese medicine (TCM) has accumulated amount of literature with a total of 1,059 volumes, more than 190,000 chapters, and more than 120,000,000 words during the last 2000 years. In the previous works, researchers annotated the phrases one by one with their own hands. Here we propose semantic annotation techniques based on Semantic units division and annotation are realized through constructing a corpus and professional semantic unit dictionary. Based on the technology, a semantic annotation method is implemented using hidden markov models, which achieves 92.2% in terms of micro-average F1 measure and 87.6% in terms of macro-average F1 measure on the case of spleen putty genre.
引用
收藏
页数:4
相关论文
共 9 条
[1]   Hidden markov models for text categorization in multi-page documents [J].
Frasconi, P ;
Soda, G ;
Vullo, A .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2002, 18 (2-3) :195-217
[2]  
Freitag D, 2000, SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), P584
[3]   Combining hidden Markov models and latent semantic analysis for topic segmentation and labeling: Method and clinical application [J].
Ginter, Filip ;
Suominen, Hanna ;
Pyysalo, Sampo ;
Salakoski, Tapio .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2009, 78 (12) :E1-E6
[4]   Using Latent Semantic Analysis and the Predication Algorithm to Improve Extraction of Meanings from a Diagnostic Corpus [J].
Jorge-Botana, Guillermo ;
Olmos, Ricardo ;
Antonio Leon, Jose .
SPANISH JOURNAL OF PSYCHOLOGY, 2009, 12 (02) :424-440
[5]  
[李开荣 Li Kairong], 2012, [微电子学与计算机, Microelectronics & Computer], V29, P161
[6]   Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples [J].
Li, Ming ;
Zhou, Zhi-Hua .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2007, 37 (06) :1088-1098
[7]  
Liu H., 2007, LIB INFORM SERVICE, V51, P41
[8]   Implementation of a Tour Guide Robot System Using RFID Technology and Viterbi Algorithm-Based HMM for Speech Recognition [J].
Pai, Neng-Sheng ;
Kuang, Hua-Jui ;
Chang, Ting-Yuan ;
Kuo, Ying-Che ;
Lai, Chun-Yuan .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
[9]  
YANG J, 2010, J COMPUTER APPL, V09