论文标题
“约翰吃5个苹果”!=“约翰吃了一些苹果”:代数单词问题的自我监督释义质量检测
'John ate 5 apples' != 'John ate some apples': Self-Supervised Paraphrase Quality Detection for Algebraic Word Problems
论文作者
论文摘要
本文介绍了代数单词问题评分释义的新任务(AWP),并提出了一种自我监督的方法。在当前的在线教学环境中,释义这些问题对于院士来说有助于产生多种句法的问题以进行评估。它还有助于引起变化,以确保学生已经理解问题,而不仅仅是记住问题或使用不公平的手段来解决问题。当前的最新释义生成模型通常无法有效地解释单词问题,而失去了关键信息(例如数字或单位),这使问题无法解决。在AWP的背景下,有必要进行释义方法,以便对良好的释义者进行培训。因此,我们提出了一种使用新型数据增强的自我监管的解释质量检测方法Paraqd,可以学习潜在表示,以通过广泛的利润将代数问题与贫穷的问题分开的高质量释义。通过广泛的实验,我们证明我们的方法的表现优于现有的最先进的自我监管方法,高达32%,同时也证明了令人印象深刻的零击性能。
This paper introduces the novel task of scoring paraphrases for Algebraic Word Problems (AWP) and presents a self-supervised method for doing so. In the current online pedagogical setting, paraphrasing these problems is helpful for academicians to generate multiple syntactically diverse questions for assessments. It also helps induce variation to ensure that the student has understood the problem instead of just memorizing it or using unfair means to solve it. The current state-of-the-art paraphrase generation models often cannot effectively paraphrase word problems, losing a critical piece of information (such as numbers or units) which renders the question unsolvable. There is a need for paraphrase scoring methods in the context of AWP to enable the training of good paraphrasers. Thus, we propose ParaQD, a self-supervised paraphrase quality detection method using novel data augmentations that can learn latent representations to separate a high-quality paraphrase of an algebraic question from a poor one by a wide margin. Through extensive experimentation, we demonstrate that our method outperforms existing state-of-the-art self-supervised methods by up to 32% while also demonstrating impressive zero-shot performance.