Automated Chinese Essay Scoring From Topic Perspective Using Regularized Latent Semantic Indexing

被引:5
作者
Hao, Shudong [1 ]
Xu, Yanyan [1 ]
Peng, Hengli [2 ]
Su, Kaile [3 ,4 ]
Ke, Dengfeng [5 ]
机构
[1] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing, Peoples R China
[2] Beijing Language & Culture Univ, Inst Educ Measurement, Beijing, Peoples R China
[3] Zhejiang Normal Univ, Coll Math Phys & Informat Engn, Jinhua, Zhejiang, Peoples R China
[4] Griffith Univ, Inst Integrated & Intelligent Syst, Brisbane, Qld 4111, Australia
[5] Chinese Acad Sci, Inst Automaton, Beijing, Peoples R China
来源
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2014年
关键词
document understanding; automated Chinese essay scoring; topic modeling application; classification application;
D O I
10.1109/ICPR.2014.533
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding out an effective way to score Chinese written essays automatically remains challenging for researchers. Several methods have been proposed and developed but limited in the character and word usage levels. As one of the scoring standards, however, content or topic perspective is also an important and necessary indicator to assess an essay. Therefore, in this paper, we propose a novel perspective - topic, and a new method integrating topic modeling strategy called Regularized Latent Semantic Indexing to recognize the latent topics and Support Vector Machines to train the scoring model. Experimental results show that automated Chinese essay scoring from topic perspective is effective which can improve the rating agreement to 89%.
引用
收藏
页码:3092 / 3097
页数:6
相关论文
共 22 条
  • [1] [Anonymous], 2001, Recent Advances in Natural language processing-RANLP'2001
  • [2] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [3] Cao Y., 2007, Exam. Res, V3, P63
  • [4] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [5] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [6] Dikli S., 2006, The Journal of Technology, Learning and Assessment, V5
  • [7] PATHWISE COORDINATE OPTIMIZATION
    Friedman, Jerome
    Hastie, Trevor
    Hoefling, Holger
    Tibshirani, Robert
    [J]. ANNALS OF APPLIED STATISTICS, 2007, 1 (02) : 302 - 332
  • [8] Regularization Paths for Generalized Linear Models via Coordinate Descent
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Rob
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01): : 1 - 22
  • [9] Automated Error Detection and Correction of Chinese Characters in Written Essays Based on Weighted Finite-State Transducer
    Hao, Shudong
    Gao, Zongtian
    Zhang, Mingqing
    Xu, Yanyan
    Peng, Hengli
    Su, Kaile
    Ke, Dengfeng
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 763 - 767
  • [10] Ke Dengfeng, 2011, P NAT C MAN MACH SPE, P25