与指针网络的不连续分析

论文标题

与指针网络的不连续分析

Discontinuous Constituent Parsing with Pointer Networks

论文作者

Fernández-González, Daniel, Gómez-Rodríguez, Carlos

论文摘要

计算语言学和NLP中使用的最复杂的句法表示之一是不连续的树木，对于代表德语等语言的所有语法现象至关重要。依赖性解析的最新进展表明，指针网络在句子中的单词之间有效解析句法关系方面表现出色。这种序列到序列模型在建造非目标依赖树方面具有出色的精确度，但是在更艰巨的任务上尚未证明其潜力。我们提出了一种新型的神经网络体系结构，该架构通过指针网络能够生成迄今为止最准确的不连续构成表示，即使无需言论部分标记信息。为此，我们在内部将不连续的组成结构模拟为增强的非注射性依赖性结构。所提出的方法在两个广泛使用的NEGRA和Tiger基准上实现了最先进的结果，从而超过了以前的工作。

One of the most complex syntactic representations used in computational linguistics and NLP are discontinuous constituent trees, crucial for representing all grammatical phenomena of languages such as German. Recent advances in dependency parsing have shown that Pointer Networks excel in efficiently parsing syntactic relations between words in a sentence. This kind of sequence-to-sequence models achieve outstanding accuracies in building non-projective dependency trees, but its potential has not been proved yet on a more difficult task. We propose a novel neural network architecture that, by means of Pointer Networks, is able to generate the most accurate discontinuous constituent representations to date, even without the need of Part-of-Speech tagging information. To do so, we internally model discontinuous constituent structures as augmented non-projective dependency structures. The proposed approach achieves state-of-the-art results on the two widely-used NEGRA and TIGER benchmarks, outperforming previous work by a wide margin.

下载PDF全文

下载文献需遵守相关版权规定

论文标题