Morphological Analysis Based Part-of-Speech Tagging for Uyghur Speech Synthesis

被引:0
|
作者
Mamateli, Guljamal [1 ]
Rozi, Askar [2 ]
Ali, Gulnar [1 ]
Hamdulla, Askar [1 ]
机构
[1] Xinjiang Univ, Inst Informat Sci & Engn, Urumqi 830046, Peoples R China
[2] Xinjiang Univ, Inst Math & Syst Sci, Urumqi 830046, Peoples R China
来源
KNOWLEDGE ENGINEERING AND MANAGEMENT | 2011年 / 123卷
基金
中国国家自然科学基金;
关键词
Uyghur-language; part-of-speech tagging; bi-gram language model; hidden markov model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accuracy of part-of-speech tagging is critical to downstream sub-tasks in front-end text analysis model of text-to-speech System. Uyghuris an agglutinative language in which numbers of words are formed by suffixes attaching to a stem (or root). Owing to there are unlimited new formed and derived syntactic words in Uyghur, Sizes of part-of-speech tagging set were big and out-of-vocabulary words often occurred in conventional Uyghur part-of-speech tagging method which directly trained and predicted the part-of-speech of word. To address this problem, this paper proposes the idea that trains the part-of-speech of stem and predicts the part-of-speech of word mainly by stem. Bi-gram language model is used to segment the stem and affix boundary of word, hidden markov model is used to train and predict part-of-speech of stem. In the end, rule adjusting method is used to adjust the changed part-of-speech of word when suffix attaching to a stem. Experimental result shows that proposed method obviously reduces the part-of-speech tagging error rate comparing to conventional part-of-speech tagging method.
引用
收藏
页码:389 / +
页数:2
相关论文
共 50 条
  • [1] Part-of-speech tagging
    Martinez, Angel R.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (01): : 107 - 113
  • [2] Improving Part-of-Speech Tagging Accuracy for Croatian by Morphological Analysis
    Agic, Zeljko
    Dovedan, Zdravko
    Tadic, Marko
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2008, 32 (04): : 445 - 451
  • [3] Improving Arabic Part-of-Speech Tagging through Morphological Analysis
    Albared, Mohammed
    Omar, Nazlia
    Ab Aziz, Mohd. Juzaiddin
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2011, PT I, 2011, 6591 : 317 - 326
  • [4] Improving part-of-speech tagging accuracy for croatian by morphological analysis
    Agic, Zeljko
    Dovedan, Zdravko
    Tadic, Marko
    Informatica (Ljubljana), 2009, 33 (02) : 169 - 176
  • [5] Improving Part-of-Speech Tagging Accuracy for Croatian by Morphological Analysis
    Agic, Zeljko
    Dovedan, Zdravko
    Tadic, Marko
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2009, 33 (02): : 161 - 167
  • [6] Corpus based part-of-speech tagging
    Lv, Chengyao
    Liu, Huihua
    Dong, Yuanxing
    Chen, Yunliang
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 647 - 654
  • [7] Morphological Segmentation and Part-of-Speech Tagging for the Arabic Heritage
    Mohamed, Emad
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (03)
  • [8] Part-of-speech tagging for Swedish
    Prütz, K
    PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 201 - 206
  • [9] Phrase-based part-of-speech tagging
    Finch, Andrew
    Sumita, Eiichiro
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 215 - +
  • [10] A Persian Part-Of-Speech Tagger Based on Morphological Analysis
    Mohseni, Mahdi
    Minaei-bidgoli, Behrouz
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1253 - 1257