Part-Of-Speech Labeling for Reuters Database

被引:0
|
作者
Cretulescu, R. [1 ]
David, A. [1 ]
Morariu, D. [1 ]
Vintan, L. [1 ]
机构
[1] Lucian Blaga Univ Sibiu, Comp Sci & Elect Engn Dept, Sibiu, Romania
来源
2015 19TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC) | 2015年
关键词
Documents Representation; Vector Space Model; Tagging Algorithms; Part of Speech;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Even if the Vector Space Model used for document representation in information retrieval systems integrates a small quantity of knowledge it continues to be used due to its computational cost, speed execution and simplicity. We try to improve this document representation by adding some syntactic information such as the parts of speech. In this paper, we have evaluated three different tagging algorithms in order to select the most suitable tagger for using it to tag the Reuters dataset. In this work, we have evaluated the taggers using only five different parts of speech: noun, verb, adverb, adjective and others. We considered these particular tags being the most representative for describing the documents into these parts of speech space.
引用
收藏
页码:117 / 122
页数:6
相关论文
共 50 条
  • [1] Part-of-speech persistence: The influence of part-of-speech information on lexical processes
    Melinger, Alissa
    Koenig, Jean-Pierre
    JOURNAL OF MEMORY AND LANGUAGE, 2007, 56 (04) : 472 - 489
  • [2] Justifying part-of-speech assignments for Mandarin gei
    Her, One-Soon
    LINGUA, 2006, 116 (08) : 1274 - 1302
  • [3] Part-of-speech tagging using genetic algorithms
    Department of Computer Science and Engineering, Lovely Professional University, Jalandhar
    Punjab, India
    Int. J. Simul. Syst. Sci. Technol., 6 (11.1-11.7): : 11.1 - 11.7
  • [4] Question Type Classification Using a Part-of-Speech Hierarchy
    Khoury, Richard
    AUTONOMOUS AND INTELLIGENT SYSTEMS, 2011, 6752 : 212 - 221
  • [5] PosCap: Boosting Video Captioning with Part-of-Speech Guidance
    Xiao, Jingfu
    Chen, Zhiliang
    Jiang, Wenhui
    Fang, Yuming
    Shen, Fei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 430 - 444
  • [6] Automatic Machine Translation Evaluation with Part-of-Speech Information
    Han, Aaron L. -F.
    Wong, Derek F.
    Chao, Lidia S.
    He, Liangye
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 121 - 128
  • [7] Training MEMM with PSO: A tool for part-of-speech tagging
    La, L. (lalei1984@yahoo.com.cn), 1600, Academy Publisher (07): : 2511 - 2517
  • [8] Part-of-Speech Tagger for Malay Social Media Texts
    Ariffin, Siti Noor Allia Noor
    Tiun, Sabrina
    GEMA ONLINE JOURNAL OF LANGUAGE STUDIES, 2018, 18 (04): : 124 - 142
  • [9] Adaptive Latency for Part-of-Speech Tagging in Incremental Text-to-Speech Synthesis
    Pouget, Mael
    Nahorna, Olha
    Hueber, Thomas
    Bailly, Gerard
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2846 - 2850
  • [10] Recurrent Neural Network Language Model with Part-of-speech for Mandarin Speech Recognition
    Gong, Caixia
    Li, Xiangang
    Wu, Xihong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 459 - 463