论文标题
通过生成证据融合和往返预测来回答模棱两可的问题
Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip Prediction
论文作者
论文摘要
在开放域的问题回答中,问题很可能是模棱两可的,因为用户在制定它们时可能不知道相关主题的范围。因此,系统需要找到对问题的可能解释,并预测一个或多个合理的答案。当找到多个合理的答案时,系统应为每个答案重写问题以解决歧义。在本文中,我们提出了一个模型,该模型汇总并结合了来自多个段落的证据,以适应一个答案或一组问答对,以解决模棱两可的问题。此外,我们提出了一种新颖的往返预测方法,以迭代生成我们的模型在第一张通过中无法找到的其他解释,然后验证和过滤不正确的问题解答对,以达到最终的不利的输出。我们的模型名为《 Repuel》,在Ambigqa数据集上实现了新的最新性能,并在NQ-OPEN和TRIVIAQA上显示了竞争性能。拟议的往返预测是一种模型不足的一般方法,用于回答模棱两可的开放域问题,可改善我们的加油和几种基线模型。我们在https://github.com/amzn/refuel-open-domain-qa上发布了模型和实验的源代码。
In open-domain question answering, questions are highly likely to be ambiguous because users may not know the scope of relevant topics when formulating them. Therefore, a system needs to find possible interpretations of the question, and predict one or multiple plausible answers. When multiple plausible answers are found, the system should rewrite the question for each answer to resolve the ambiguity. In this paper, we present a model that aggregates and combines evidence from multiple passages to adaptively predict a single answer or a set of question-answer pairs for ambiguous questions. In addition, we propose a novel round-trip prediction approach to iteratively generate additional interpretations that our model fails to find in the first pass, and then verify and filter out the incorrect question-answer pairs to arrive at the final disambiguated output. Our model, named Refuel, achieves a new state-of-the-art performance on the AmbigQA dataset, and shows competitive performance on NQ-Open and TriviaQA. The proposed round-trip prediction is a model-agnostic general approach for answering ambiguous open-domain questions, which improves our Refuel as well as several baseline models. We release source code for our models and experiments at https://github.com/amzn/refuel-open-domain-qa.