Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus

被引:0
|
作者
Ono, Takahiro [1 ]
Tohyama, Hitomi [1 ]
Matsubara, Shigeki [1 ]
机构
[1] Nagoya Univ, Grad Sch Informat Sci, Chikusa Ku, Nagoya, Aichi 4648601, Japan
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In this paper, quantitative analyses of the delay in Japanese-to-English (J-E) and English-to-Japanese (E-J) interpretations are described. The Simultaneous Interpretation Database of Nagoya University (SIDB) was used for the analyses. Beginning time and end time of each word were provided to the corpus using HMM-based phoneme segmentation, and the time lag between the corresponding words was calculated as the word-level delay. Word-level delay was calculated for 3,722 pairs and 4,932 pairs of words for J-E and E-J interpretations, respectively. The analyses revealed that J-E interpretation have much larger delay than E-J interpretation and that the difference of word order between Japanese and English affect the degree of delay.
引用
收藏
页码:3383 / 3387
页数:5
相关论文
共 50 条
  • [31] A shapelet-based framework for large-scale word-level sign language database auto-construction
    Xiang Ma
    Qiang Wang
    Tianyou Zheng
    Lin Yuan
    Neural Computing and Applications, 2023, 35 : 253 - 274
  • [32] A shapelet-based framework for large-scale word-level sign language database auto-construction
    Ma, Xiang
    Wang, Qiang
    Zheng, Tianyou
    Yuan, Lin
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (01): : 253 - 274
  • [33] Revisiting N waiting to happen: word, construction, and corpus choices in a collostructional analysis
    Newman, John
    CORPUS LINGUISTICS AND LINGUISTIC THEORY, 2024,
  • [34] ENUNCIATIVE ANALYSIS OF MODALS IN LIBRAS-PORTUGUESE SIMULTANEOUS INTERPRETATION'S CORPUS
    da Silva, Anderson Almeida
    Lima de Carvalho, Ana Paula
    CADERNOS DE TRADUCAO, 2015, 35 (02): : 289 - 318
  • [35] WORD-LEVEL ASR QUALITY ESTIMATION FOR EFFICIENT CORPUS SAMPLING AND POST-EDITING THROUGH ANALYZING ATTENTIONS OF A REFERENCE-FREE METRIC
    Javadi, Golara
    Yuksel, Kamer Ali
    Kim, Yunsu
    Ferreira, Thiago Castro
    Al-Badrashiny, Mohamed
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 863 - 867
  • [36] Sentiment analysis of code-mixed Dravidian languages leveraging pretrained model and word-level language tag
    Chanda, Supriya
    Mishra, Anshika
    Pal, Sukomal
    NATURAL LANGUAGE PROCESSING, 2025, 31 (02): : 477 - 499
  • [38] Microblog Sentiment Analysis Based on Dynamic Character-Level and Word-Level Features and Multi-Head Self-Attention Pooling
    Yan, Shangyi
    Wang, Jingya
    Song, Zhiqiang
    FUTURE INTERNET, 2022, 14 (08):
  • [39] A Stratal Phonological Analysis of Stem-Level and Word-Level Effects in Old French Compensatory Vowel Lengthening upon Coda /s/ Deletion
    Montano, Francisco Antonio
    LANGUAGES, 2024, 9 (05)
  • [40] Serial naming speed and the component elements of speech time and pause time: relationships with the development of word-level reading in children aged four to five years
    Cobbold, S
    Passenger, T
    Terrell, C
    JOURNAL OF RESEARCH IN READING, 2003, 26 (02) : 165 - 176