Unsupervised Sub-tree Alignment for Tree-to-Tree Translation

被引:7
|
作者
Xiao, Tong [1 ]
Zhu, Jingbo [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
D O I
10.1613/jair.4033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a probabilistic sub-tree alignment model and its application to tree-to-tree machine translation. Unlike previous work, we do not resort to surface heuristics or expensive annotated data, but instead derive an unsupervised model to infer the syntactic correspondence between two languages. More importantly, the developed model is syntactically-motivated and does not rely on word alignments. As a by-product, our model outputs a sub-tree alignment matrix encoding a large number of diverse alignments between syntactic structures, from which machine translation systems can efficiently extract translation rules that are often filtered out due to the errors in 1-best alignment. Experimental results show that the proposed approach outperforms three state-of-the-art baseline approaches in both alignment accuracy and grammar quality. When applied to machine translation, our approach yields a +1.0 BLEU improvement and a -0.9 TER reduction on the NIST machine translation evaluation corpora. With tree binarization and fuzzy decoding, it even outperforms a state-of-the-art hierarchical phrase-based system.
引用
收藏
页码:733 / 782
页数:50
相关论文
共 50 条
  • [41] Optimal sub-tree scheduling for wireless sensor networks with partial coverage
    Adasme, Pablo
    COMPUTER STANDARDS & INTERFACES, 2019, 61 : 20 - 35
  • [42] Deep Learning and Sub-Tree Mining for Document Level Sentiment Classification
    Ngoc Phuong Chau
    Viet Anh Phan
    Minh Le Nguyen
    2016 EIGHTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2016, : 268 - 273
  • [43] Unsupervised learning of tree alignment models for information extraction
    Zigoris, Philip
    Eads, Damian
    Zhang, Yi
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 45 - +
  • [44] Novel sub-tree sharing based multicast states aggregation scheme
    Dong, Ping
    Zhang, Hong-Ke
    Yang, Dong
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2008, 20 (15): : 4168 - 4172
  • [45] Adaptive fractal image compression using wavelet sub-tree coefficients
    El Khamy, SE
    Hamdy, NA
    Shatila, H
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 536 - 539
  • [47] Optimal mappings with minimum number of connected components in tree-to-tree comparison problems
    Ferraro, P
    Godin, C
    JOURNAL OF ALGORITHMS, 2003, 48 (02) : 385 - 406
  • [48] Enhancing vulnerability detection via AST decomposition and neural sub-tree encoding
    Tian, Zhenzhou
    Tian, Binhui
    Lv, Jiajun
    Chen, Yanping
    Chen, Lingwei
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [49] Methodology of tree-to-tree variation study of safou lipids (Dacryodes edulis) as biodiversity indicator
    Silou, T.
    Kinkela, T.
    Heron, S.
    Tchapla, A.
    RIVISTA ITALIANA DELLE SOSTANZE GRASSE, 2006, 83 (03): : 129 - 136
  • [50] On Properties of the Minimum Entropy Sub-tree to Compute Lower Bounds on the Partition Function
    Molkaraie, Mehdi
    Pakzad, Payam
    2008 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-6, 2008, : 2504 - +