论文标题

增强了反映土耳其语凝结性质的BOUN TREEBANK

Enhancements to the BOUN Treebank Reflecting the Agglutinative Nature of Turkish

论文作者

Marşan, Büşra, Akkurt, Salih Furkan, Şen, Muhammet, Gürbüz, Merve, Güngör, Onur, Özateş, Şaziye Betül, Üsküdarlı, Suzan, Özgür, Arzucan, Güngör, Tunga, Öztürk, Balkız

论文摘要

在这项研究中,我们旨在提供以语言动机的解决方案,以解决缺乏无效词素的代表性,高生产力的衍生过程和土耳其在Bon treebank中的融合形式的问题,而不会与普遍的依赖关系框架不同。 为了解决这些问题,通过将某些引理并在UD框架中使用MISC(其他)选项卡来表示新的注释约定来表示派生。在基于LSTM的依赖性解析器上测试了重新注销的树库的代表性功能,并引入了船工具的更新版本。

In this study, we aim to offer linguistically motivated solutions to resolve the issues of the lack of representation of null morphemes, highly productive derivational processes, and syncretic morphemes of Turkish in the BOUN Treebank without diverging from the Universal Dependencies framework. In order to tackle these issues, new annotation conventions were introduced by splitting certain lemmas and employing the MISC (miscellaneous) tab in the UD framework to denote derivation. Representational capabilities of the re-annotated treebank were tested on a LSTM-based dependency parser and an updated version of the BoAT Tool is introduced.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源