Chinese Question Classification Based on Semantic Gram and SVM

被引:1
作者
Wang, Liang [1 ]
Zhang, Hui [1 ]
Wang, Deqing [1 ]
Huang, Jia [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
来源
2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS | 2009年
关键词
Chinese question classification; SVM; feature extraction; semantic gram;
D O I
10.1109/IFCSTA.2009.111
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Question classification plays a crucial important role in the question answering system. Recent research on question classification for open-domain mostly concentrates on using machine learning methods to resolve the special kind of text classification. This paper presents our research about Chinese question classification using machine learning method and gives our approach based on SVM and semantic gram extraction. SVM has been widely used for question classification and got good performances. We use SVM as the classifier and propose a new feature extraction method of Chinese questions which is called semantic gram extraction. The method is proposed based on the word semantics and N-gram. The experiment results show that the feature extraction can perform well with SVM and our approach can reach high classification accuracy.
引用
收藏
页码:432 / 435
页数:4
相关论文
共 10 条
  • [1] Cristianini N., 2000, INTRO SUPPORT VECTOR
  • [2] DELLZHANG, 2003, P SIGIR 03 TOR CAN
  • [3] DONG Z., 2006, Hownet and the computation of meaning
  • [4] JIA KL, 2007, P 8 ACIS INT C SOFTW
  • [5] KADRI H, 2003, P HLTNACCL 2003 EDM, P28
  • [6] Liu Q., 2002, COMPUTATIONAL LINGUI
  • [7] Prager John, 2006, Foundations and Trends in Information Retrieval, V1, P91, DOI 10.1561/1500000001
  • [8] SUZUKI TJ, 2003, P 2003 ACL WORKSH MU, P61
  • [9] Xu W, 2006, RES QUESTION CLASSIF
  • [10] ZHANG HP, 2003, P 2 SIGHAN WORKSH AF