Unsupervised Sub-tree Alignment for Tree-to-Tree Translation

被引:7
|
作者
Xiao, Tong [1 ]
Zhu, Jingbo [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
D O I
10.1613/jair.4033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a probabilistic sub-tree alignment model and its application to tree-to-tree machine translation. Unlike previous work, we do not resort to surface heuristics or expensive annotated data, but instead derive an unsupervised model to infer the syntactic correspondence between two languages. More importantly, the developed model is syntactically-motivated and does not rely on word alignments. As a by-product, our model outputs a sub-tree alignment matrix encoding a large number of diverse alignments between syntactic structures, from which machine translation systems can efficiently extract translation rules that are often filtered out due to the errors in 1-best alignment. Experimental results show that the proposed approach outperforms three state-of-the-art baseline approaches in both alignment accuracy and grammar quality. When applied to machine translation, our approach yields a +1.0 BLEU improvement and a -0.9 TER reduction on the NIST machine translation evaluation corpora. With tree binarization and fuzzy decoding, it even outperforms a state-of-the-art hierarchical phrase-based system.
引用
收藏
页码:733 / 782
页数:50
相关论文
共 50 条
  • [21] Model for forecasting the distribution of the minimum tree-to-tree distances
    Hui, Gang-Ying
    Xu, Hai
    Hu, Yan-Bo
    Beijing Linye Daxue Xuebao/Journal of Beijing Forestry University, 2006, 28 (05): : 18 - 21
  • [22] Sub-tree Swapping Crossover and Arity Histogram Distributions
    Dignum, Stephen
    Poli, Riccardo
    GENETIC PROGRAMMING, PROCEEDINGS, 2010, 6021 : 38 - 49
  • [23] Resource Cooperative Integrated Scheduling Algorithm Based on Sub-tree Cycle Decomposition of Process Tree
    Xie Z.
    Zhou W.
    Yang J.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2022, 58 (13): : 228 - 239
  • [24] Kinematic and dynamic analysis of a brachiating tree-to-tree machine
    Meaclem, Christopher V.
    Gutschmidt, Stefanie
    Chen, XiaoQi
    Parker, Richard
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 1311 - 1316
  • [25] New algorithm for ordered tree-to-tree correction problem
    Chen, WM
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2001, 40 (02): : 135 - 158
  • [26] Efficient designs for Bayesian networks with sub-tree bounds
    Marie Lilleborge
    Jo Eidsvik
    Statistics and Computing, 2017, 27 : 301 - 318
  • [27] SUB-TREE COUNTS ON HYPERBOLIC RANDOM GEOMETRIC GRAPHS
    Owada, Takashi
    Yogeshwaran, D.
    ADVANCES IN APPLIED PROBABILITY, 2022, 54 (04) : 1032 - 1069
  • [28] A Pareto-Beneficial Sub-Tree Mutation for the Multi-Criteria Minimum Spanning Tree Problem
    Bossek, Jakob
    Grimme, Christian
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3280 - 3287
  • [29] Syntax-Based Chinese-Vietnamese Tree-to-Tree Statistical Machine Translation with Bilingual Features
    Gao, Shengxiang
    Huang, Jihao
    Xue, Mingya
    Yu, Zhengtao
    Wang, Zhuo
    Zhang, Yang
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (04)
  • [30] Sub-tree based upper and lower bounds on the partition function
    Molkaraie, Mehdi
    Pakzad, Payam
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 2047 - +