论文标题
模拟机器翻译中系统组合的投票
Modeling Voting for System Combination in Machine Translation
论文作者
论文摘要
系统组合是结合不同机器翻译系统的假设以提高翻译性能的重要技术。尽管已经证明了早期的系统组合统计方法在分析假设之间的共识中有效,但由于使用管道的使用,它们遭受了错误传播问题的困扰。尽管最近对多源序列到序列模型的端到端培训来缓解了这个问题,但这些神经模型并未明确分析假设之间的关系,并且无法捕获他们的一致性,因为在假设中对单词的关注是独立计算的,而忽略了该单词在多个假设中可能发生的事实。在这项工作中,我们提出了一种模拟机器翻译系统组合投票的方法。基本思想是在来自不同系统的假设中启用单词,以对代表性的单词进行投票,并应参与生成过程。这可以通过量化每个选民的影响及其对每个候选人的偏好来完成。我们的方法结合了统计和神经方法的优势,因为它不仅可以分析假设之间的关系,还可以进行端到端的培训。实验表明,我们的方法能够更好地利用假设之间的共识,并在中文英语和英国 - 德国机器翻译任务上取得了显着改善。
System combination is an important technique for combining the hypotheses of different machine translation systems to improve translation performance. Although early statistical approaches to system combination have been proven effective in analyzing the consensus between hypotheses, they suffer from the error propagation problem due to the use of pipelines. While this problem has been alleviated by end-to-end training of multi-source sequence-to-sequence models recently, these neural models do not explicitly analyze the relations between hypotheses and fail to capture their agreement because the attention to a word in a hypothesis is calculated independently, ignoring the fact that the word might occur in multiple hypotheses. In this work, we propose an approach to modeling voting for system combination in machine translation. The basic idea is to enable words in hypotheses from different systems to vote on words that are representative and should get involved in the generation process. This can be done by quantifying the influence of each voter and its preference for each candidate. Our approach combines the advantages of statistical and neural methods since it can not only analyze the relations between hypotheses but also allow for end-to-end training. Experiments show that our approach is capable of better taking advantage of the consensus between hypotheses and achieves significant improvements over state-of-the-art baselines on Chinese-English and English-German machine translation tasks.