论文标题
从签名到口语的机器翻译:艺术和挑战的状态
Machine Translation from Signed to Spoken Languages: State of the Art and Challenges
论文作者
论文摘要
从签名到口语的自动翻译是一个跨学科的研究领域,位于计算机视觉,机器翻译和语言学的交集。然而,该领域的研究主要由计算机科学家孤立地进行。随着该领域变得越来越流行 - 过去三年来已经发表了有关手语翻译主题的大多数科学论文 - 我们提供了最新技术的概述以及不同相关学科的一些必需背景。我们对手语语言学和机器翻译进行了高级介绍,以说明自动手语翻译的要求。我们提出了系统的文献综述,以说明域中的最新技术,然后回溯到要求,为将来的研究提出了一些挑战。我们发现,在口头语言机器翻译研究的肩膀上已取得了重大进展。但是,当前的方法通常不会是语言动机的,也没有适应符号语言的不同输入方式。我们探讨了与手语数据的表示,数据集的收集,对跨学科研究的需求以及超越研究的需求以及向应用程序迈进的挑战。根据我们的发现,我们主张跨学科研究,并基于对符号语言的语言分析的未来研究。此外,在用例识别中包含聋人和听力最终用户,数据收集和评估在创建有用的手语翻译模型中至关重要。我们建议迭代,人类在循环,手语翻译模型的设计和开发。
Automatic translation from signed to spoken languages is an interdisciplinary research domain, lying on the intersection of computer vision, machine translation and linguistics. Nevertheless, research in this domain is performed mostly by computer scientists in isolation. As the domain is becoming increasingly popular - the majority of scientific papers on the topic of sign language translation have been published in the past three years - we provide an overview of the state of the art as well as some required background in the different related disciplines. We give a high-level introduction to sign language linguistics and machine translation to illustrate the requirements of automatic sign language translation. We present a systematic literature review to illustrate the state of the art in the domain and then, harking back to the requirements, lay out several challenges for future research. We find that significant advances have been made on the shoulders of spoken language machine translation research. However, current approaches are often not linguistically motivated or are not adapted to the different input modality of sign languages. We explore challenges related to the representation of sign language data, the collection of datasets, the need for interdisciplinary research and requirements for moving beyond research, towards applications. Based on our findings, we advocate for interdisciplinary research and to base future research on linguistic analysis of sign languages. Furthermore, the inclusion of deaf and hearing end users of sign language translation applications in use case identification, data collection and evaluation is of the utmost importance in the creation of useful sign language translation models. We recommend iterative, human-in-the-loop, design and development of sign language translation models.