Unsupervised Sub-tree Alignment for Tree-to-Tree Translation

被引:7
|
作者
Xiao, Tong [1 ]
Zhu, Jingbo [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
D O I
10.1613/jair.4033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a probabilistic sub-tree alignment model and its application to tree-to-tree machine translation. Unlike previous work, we do not resort to surface heuristics or expensive annotated data, but instead derive an unsupervised model to infer the syntactic correspondence between two languages. More importantly, the developed model is syntactically-motivated and does not rely on word alignments. As a by-product, our model outputs a sub-tree alignment matrix encoding a large number of diverse alignments between syntactic structures, from which machine translation systems can efficiently extract translation rules that are often filtered out due to the errors in 1-best alignment. Experimental results show that the proposed approach outperforms three state-of-the-art baseline approaches in both alignment accuracy and grammar quality. When applied to machine translation, our approach yields a +1.0 BLEU improvement and a -0.9 TER reduction on the NIST machine translation evaluation corpora. With tree binarization and fuzzy decoding, it even outperforms a state-of-the-art hierarchical phrase-based system.
引用
收藏
页码:733 / 782
页数:50
相关论文
共 50 条
  • [31] A Mathematical Formula Retrieval Method Using Structure Sub-tree
    Guan, Mingjie
    Tian, Xuedong
    Yang, Fang
    Yang, Songqiang
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 583 - 586
  • [32] Fractal Image Coding Based on Oriented Wavelet Sub-tree
    Jiang Shan
    Shuang Kai
    Sun Li Wei
    ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 273 - +
  • [34] Sub-tree Swapping Crossover, Allele Diffusion and GP Convergence
    Dignum, Stephen
    Poli, Riccardo
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN X, PROCEEDINGS, 2008, 5199 : 368 - 377
  • [35] Semantic Sub-tree Crossover Operator for Postfix Genetic Programming
    Dabhi, Vipul K.
    Chaudhary, Sanjay
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS (BIC-TA 2012), VOL 1, 2013, 201 : 391 - +
  • [36] Fast code recommendation via approximate sub-tree matching
    Shao, Yichao
    Huang, Zhiqiu
    Li, Weiwei
    Yu, Yaoshen
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (08) : 1205 - 1216
  • [37] Head- and relation-driven tree-to-tree translation using phrases in a monolingual corpus - An extension of CBMT
    National Institute of Information and Communications Technology , Keihanna Science City, Japan
    Int. Univers. Commun. Symp., IUCS - Proc., (15-22):
  • [38] A Data Augmentation Method Based on Sub-tree Exchange for Low-Resource Neural Machine Translation
    Chi, Chuncheng
    Li, Fuxue
    Yan, Hong
    Guan, Hui
    Zhao, Zhongchao
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 646 - 657
  • [39] Tree-to-tree interactions slow down Himalayan treeline shifts as inferred from tree spatial patterns
    Sigdel, Shalik Ram
    Liang, Eryuan
    Wang, Yafeng
    Dawadi, Binod
    Camarero, Jesus Julio
    JOURNAL OF BIOGEOGRAPHY, 2020, 47 (08) : 1816 - 1826
  • [40] POLLEN TRANSFER IN APPLE ORCHARDS - TREE-TO-TREE OR BEE-TO-BEE
    DEGRANDIHOFFMAN, G
    HOOPINGARNER, R
    BAKER, K
    BEE WORLD, 1984, 65 (03) : 126 - 133