论文标题
调查自由形式理由的好处
Investigating the Benefits of Free-Form Rationales
论文作者
论文摘要
自由形式的理由旨在通过提供可以帮助理解模型决策的背景知识来帮助模型的解释性。在流行数据集(例如COS-E和ECQA)中,提供了众包质量质量的质量质量质量质量,但它们的效用仍然不足。我们提出了人类研究,该研究表明,ECQA理由确实提供了其他背景信息以了解决策,而超过88%的COS-E理由则没有。受这一发现的启发,我们问:自由形式的理由模型提供的其他上下文是否可以与人类用户相似?我们通过改变培训期间的理由数量和质量来研究理由作为其他监督的效用。在控制理由泄漏正确答案同时不提供其他背景知识的情况下,我们发现在训练期间仅纳入5%的理由可以使COS-E的模型性能提高47.22%,而推断期间ECQA的eCQA则可以提高57.14%。此外,我们还表明了理由质量重要:与众包理由相比,T5生成的理由不仅为模型提供了较弱的监督,而且对人类在有助于模型的解释性方面也没有帮助。
Free-form rationales aim to aid model interpretability by supplying the background knowledge that can help understand model decisions. Crowdsourced rationales are provided for commonsense QA instances in popular datasets such as CoS-E and ECQA, but their utility remains under-investigated. We present human studies which show that ECQA rationales indeed provide additional background information to understand a decision, while over 88% of CoS-E rationales do not. Inspired by this finding, we ask: can the additional context provided by free-form rationales benefit models, similar to human users? We investigate the utility of rationales as an additional source of supervision, by varying the quantity and quality of rationales during training. After controlling for instances where rationales leak the correct answer while not providing additional background knowledge, we find that incorporating only 5% of rationales during training can boost model performance by 47.22% for CoS-E and 57.14% for ECQA during inference. Moreover, we also show that rationale quality matters: compared to crowdsourced rationales, T5-generated rationales provide not only weaker supervision to models, but are also not helpful for humans in aiding model interpretability.