论文标题
自然语言处理的分类工具
Categorical Tools for Natural Language Processing
论文作者
论文摘要
本文将类别理论与计算语言学之间的翻译发展为自然语言处理的基础。这三章涉及语法,语义和语用学。首先,弦图在形式语法中提供了统一的句法结构模型。其次,函子通过将图表变成逻辑,张量,神经或量子计算来计算语义。第三,可以将结果的功能模型组成,以形成平衡是语言处理任务解决方案的游戏。该框架是作为Discopy的一部分实现的,即使用字符串图计算的Python库。我们描述了分类,语言和计算结构之间的对应关系,并证明了它们在组成自然语言处理中的应用。
This thesis develops the translation between category theory and computational linguistics as a foundation for natural language processing. The three chapters deal with syntax, semantics and pragmatics. First, string diagrams provide a unified model of syntactic structures in formal grammars. Second, functors compute semantics by turning diagrams into logical, tensor, neural or quantum computation. Third, the resulting functorial models can be composed to form games where equilibria are the solutions of language processing tasks. This framework is implemented as part of DisCoPy, the Python library for computing with string diagrams. We describe the correspondence between categorical, linguistic and computational structures, and demonstrate their applications in compositional natural language processing.