论文标题

翻译人员可靠性评估的贝叶斯方法

A Bayesian approach to translators' reliability assessment

论文作者

Miccheli, Marco, Leban, Andrej, Tacchella, Andrea, Zaccaria, Andrea, Mazzilli, Dario, Bratières, Sébastien

论文摘要

翻译质量评估(TQA)是由人类翻译人员进行的一个过程,并被广泛使用,既用于估计(日益使用)机器翻译的性能,又用于在翻译提供商及其客户之间找到一致。虽然翻译学者意识到有一种可靠的方法进行TQA流程的重要性,但似乎有限的文献以定量方法来解决可靠性问题。在这项工作中,我们将TQA视为复杂系统物理学的角度的复杂过程,并从贝叶斯范式中处理可靠性问题。使用翻译质量评估的数据集(以错误注释的形式),完全由专业翻译服务提供商翻译的SRL产生,我们比较了两个贝叶斯模型,这些模型参数涉及TQA过程中涉及的以下功能:翻译困难,涉及翻译的特征,涉及翻译的特征以及评估质量的质量 - 审阅者。我们在无监督的设置中验证模型,并表明即使每次翻译只有一份评论,也可以对翻译人员进行有意义的见解。随后,我们提取了诸如翻译人员的技能和审阅者的严格性以及他们各自角色的一致性之类的信息。使用此情况,我们表明,即使在专家翻译的情况下,审阅者的可靠性也不能被认为是理所当然的:翻译人员的专业知识在审查另一个翻译人员产生的翻译时会引起认知偏见。但是,最多的专家翻译人员的特征是最高水平的一致性,无论是在翻译和评估翻译质量时。

Translation Quality Assessment (TQA) is a process conducted by human translators and is widely used, both for estimating the performance of (increasingly used) Machine Translation, and for finding an agreement between translation providers and their customers. While translation scholars are aware of the importance of having a reliable way to conduct the TQA process, it seems that there is limited literature that tackles the issue of reliability with a quantitative approach. In this work, we consider the TQA as a complex process from the point of view of physics of complex systems and approach the reliability issue from the Bayesian paradigm. Using a dataset of translation quality evaluations (in the form of error annotations), produced entirely by the Professional Translation Service Provider Translated SRL, we compare two Bayesian models that parameterise the following features involved in the TQA process: the translation difficulty, the characteristics of the translators involved in producing the translation, and of those assessing its quality - the reviewers. We validate the models in an unsupervised setting and show that it is possible to get meaningful insights into translators even with just one review per translation; subsequently, we extract information like translators' skills and reviewers' strictness, as well as their consistency in their respective roles. Using this, we show that the reliability of reviewers cannot be taken for granted even in the case of expert translators: a translator's expertise can induce a cognitive bias when reviewing a translation produced by another translator. The most expert translators, however, are characterised by the highest level of consistency, both in translating and in assessing the translation quality.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源