Robust rule-based method for automatic break assignment in Russian texts

被引:0
|
作者
Oparin, I
机构
[1] St Petersburg State Univ, Dept Phonet, St Petersburg 199034, Russia
[2] Speech Technol Ctr, St Petersburg 196084, Russia
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2005年 / 3658卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper a new rule-based approach to break assignment for the Russian language is discussed. It is a flexible and robust method of segmentation of texts in Russian in prosodic units. We implemented it in the recent "Orator", text-to-speech (TTS) system. The model was developed to use for the inflective languages as an alternative both for statistic and for strict rule-based algorithms. It is designed in such a way that all potentially tunable context dependencies are brought up to the interface grammar and can be easily modified by linguists. The algorithm we developed performs well on different kinds of texts due to this simple and intuitive grammar built upon an elaborate mechanism of morpho-grammatical analysis. Juncture correct rate varies between more than 98% for simple literary texts and 85% for raw transcripts of spontaneous speech.
引用
收藏
页码:356 / 363
页数:8
相关论文
共 50 条
  • [1] Automatic Robust Rule-Based Phonetization of Standard Arabic
    Sindran, Fadi
    Mualla, Firas
    Bobzin, Katharina
    Noeth, Elmar
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 442 - 451
  • [2] A rule-based system for automatic assignment of technicians to service faults
    Lazarov, A
    Shoval, P
    DECISION SUPPORT SYSTEMS, 2002, 32 (04) : 343 - 360
  • [3] A rule-based automatic sleep staging method
    Liang, Sheng-Fu
    Kuo, Chih-En
    Hu, Yu-Han
    Cheng, Yu-Shian
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 6067 - 6070
  • [4] A rule-based automatic sleep staging method
    Liang, Sheng-Fu
    Kuo, Chin-En
    Hu, Yu-Han
    Cheng, Yu-Shian
    JOURNAL OF NEUROSCIENCE METHODS, 2012, 205 (01) : 169 - 176
  • [5] Automatic normalization of short texts by combining statistical and rule-based techniques
    Marta R. Costa-jussà
    Rafael E. Banchs
    Language Resources and Evaluation, 2013, 47 : 179 - 193
  • [6] Automatic normalization of short texts by combining statistical and rule-based techniques
    Costa-jussa, Marta R.
    Banchs, Rafael E.
    LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (01) : 179 - 193
  • [7] Research of the Rule-based Automatic Billing Method of Goods
    Ma, Jingjing
    Lv, Xiyan
    2012 INTERNATIONAL CONFERENCE ON APPLIED INFORMATICS AND COMMUNICATION (ICAIC 2012), 2013, : 204 - 209
  • [8] Robust Rule-Based Method for Human Activity Recognition
    Sugimoto, Masafumi
    Zin, Thi Thi
    Toriu, Takashi
    Nakajima, Shigeyoshi
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (04): : 37 - 43
  • [9] A rule-based system for automatic de-identification of medical narrative texts
    Jaćimović, Jelena
    Krstev, Cvetana
    Jelovac, Drago
    Informatica (Slovenia), 2015, 39 (01): : 45 - 53
  • [10] A Rule-Based System for Automatic De-identification of Medical Narrative Texts
    Jacimovic, Jelena
    Krstev, Cvetana
    Jelovac, Drago
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (01): : 43 - 51