Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus

被引:0
|
作者
Ono, Takahiro [1 ]
Tohyama, Hitomi [1 ]
Matsubara, Shigeki [1 ]
机构
[1] Nagoya Univ, Grad Sch Informat Sci, Chikusa Ku, Nagoya, Aichi 4648601, Japan
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In this paper, quantitative analyses of the delay in Japanese-to-English (J-E) and English-to-Japanese (E-J) interpretations are described. The Simultaneous Interpretation Database of Nagoya University (SIDB) was used for the analyses. Beginning time and end time of each word were provided to the corpus using HMM-based phoneme segmentation, and the time lag between the corresponding words was calculated as the word-level delay. Word-level delay was calculated for 3,722 pairs and 4,932 pairs of words for J-E and E-J interpretations, respectively. The analyses revealed that J-E interpretation have much larger delay than E-J interpretation and that the difference of word order between Japanese and English affect the degree of delay.
引用
收藏
页码:3383 / 3387
页数:5
相关论文
共 50 条
  • [1] Listenership Behaviours in Intercultural Encounters: A Time-aligned Multimodal Corpus Analysis
    Ma, Wen
    Wu, Yijin
    DISCOURSE & SOCIETY, 2016, 27 (01) : 122 - 123
  • [2] Time-aligned SVD analysis for speaker identification
    Clemins, P
    Ewalt, H
    Johnson, M
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4160 - 4160
  • [3] The Maaloula Aramaic Speech Corpus (MASC): From Printed Material to a Lemmatized and Time-Aligned Corpus
    Eid, Ghattas
    Seyffarth, Esther
    Plag, Ingo
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6513 - 6520
  • [4] Enhanced Simultaneous Machine Translation with Word-level Policies
    Kim, Kang
    Cho, Hankyu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15616 - 15634
  • [6] Bayesian time-aligned factor analysis of paired multivariate time series
    Roy, Arkaprava
    Borg, Jana Schaich
    Dunson, David B.
    Journal of Machine Learning Research, 2021, 22
  • [7] Word-Level Contextual Sentiment Analysis with Interpretability
    Ito, Tomoki
    Tsubouchi, Kota
    Sakaji, Hiroki
    Yamashita, Tatsuo
    Izumi, Kiyoshi
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4231 - 4238
  • [8] Word-level Dependency-structure Annotation to Corpus of Spontaneous Japanese and Its Application
    Uchimoto, Kiyotaka
    Den, Yasuharu
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3118 - 3122
  • [9] A Corpus of Word-Level Offline Handwritten Numeral Images from Official Indic Scripts
    Obaidullah, Sk Md
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 1, 2016, 379 : 703 - 711
  • [10] Building a Time-Aligned Cross-Linguistic Reference Corpus from Language Documentation Data (DoReCo)
    Paschen, Ludger
    Delafontaine, Francois
    Draxler, Christoph
    Fuchs, Susanne
    Stave, Matthew
    Seifart, Frank
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2657 - 2666