Exploiting meta features for dependency parsing and part-of-speech tagging

被引:11
|
作者
Chen, Wenliang [1 ]
Zhang, Min [1 ]
Zhang, Yue [2 ]
Duan, Xiangyu [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
[2] Singapore Univ Technol & Design, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Dependency parsing; Natural language processing; Meta-features; Part-of-speech tagging; Semi-supervised approach;
D O I
10.1016/j.artint.2015.09.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, discriminative methods have achieved much progress in natural language processing tasks, such as parsing, part-of-speech tagging, and word segmentation. For these methods, conventional features in a relatively high dimensional feature space may suffer from sparseness and thus exhibit less discriminative power on unseen data. This article presents a learning framework of feature transformation, addressing the sparseness problem by transforming sparse conventional base features into less sparse high-level features (i.e. meta features) with the help of a large amount of automatically annotated data. The meta features are derived by bucketing similar base features according to the frequency in large data, and used together with base features in our final system. We apply the framework to part-of-speech tagging and dependency parsing. Experimental results show that our systems perform better than the baseline systems in both tasks on standard evaluation. For the dependency parsing task, our parsers achieve state-of-the-art accuracy on the Chinese data and comparable accuracy with the best known systems on the English data. Further analysis indicates that our proposed approach is effective in processing unseen data and features. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:173 / 191
页数:19
相关论文
共 50 条
  • [1] Part-of-speech tagging and partial parsing
    Abney, S
    CORPUS-BASED METHODS IN LANGUAGE AND SPEECH PROCESSING, 1997, 2 : 118 - 136
  • [3] Grammar-Supervised End-to-End Speech Recognition with Part-of-Speech Tagging and Dependency Parsing
    Wan, Genshun
    Mao, Tingzhi
    Zhang, Jingxuan
    Chen, Hang
    Gao, Jianqing
    Ye, Zhongfu
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [4] Part-of-speech tagging
    Martinez, Angel R.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [5] A Part-of-speech Tagging Model Employing Word Clustering and Syntactic Parsing
    YUAN Lichi
    ChineseJournalofElectronics, 2014, 23 (01) : 109 - 114
  • [6] A Part-of-speech Tagging Model Employing Word Clustering and Syntactic Parsing
    Yuan Lichi
    CHINESE JOURNAL OF ELECTRONICS, 2014, 23 (01) : 109 - 114
  • [7] Improving Part-of-Speech Tagging by Meta-learning
    Kobylinski, Lukasz
    Wasiluk, Michal
    Wojdyga, Grzegorz
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 144 - 152
  • [8] Cross-Language Dependency Parsing Using Part-of-Speech Patterns
    Bednar, Peter
    TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 117 - 124
  • [9] Part-of-speech tagging for Swedish
    Prütz, K
    PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206
  • [10] Feature-rich part-of-speech tagging with a cyclic dependency network
    Toutanova, K
    Klein, D
    Manning, CD
    Singer, Y
    HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 252 - 259