论文标题
可解释的研究复制预测通过各种上下文一致性句子掩盖
Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence Masking
论文作者
论文摘要
研究复制预测(RRP)是预测是否可以复制已发表的研究结果的任务。为RRP构建可解释的神经文本分类器,促进了人们对为什么将研究论文预测为可复制或不可复制的理解,因此使其现实世界应用更加可靠和值得信赖。但是,先前关于模型解释的作品主要集中于在单词/短语级别上改善模型的解释性,这对于RRP的长期研究论文尤其不足。此外,现有方法无法利用大型未标记的数据集进一步提高模型解释性。为了解决这些局限性,我们旨在建立一个可解释的神经模型,该模型可以提供句子级别的解释并采用弱监督的方法来进一步利用大量未标记的数据集,以提高可解释性,除了改善预测性能,因为现有工作已经完成。在这项工作中,我们建议使用标记和未标记的数据集在分类器中的上下文中自动提取基于分类器中上下文的键句,以自动提取关键句子。我们在RRP上的实验以及欧洲人权公约(ECHR)数据集的结果表明,VCCSM能够改善使用该领域的长期文档分类任务的模型可解释性,而不是扰动曲线和事后准确性作为评估指标。
Research Replication Prediction (RRP) is the task of predicting whether a published research result can be replicated or not. Building an interpretable neural text classifier for RRP promotes the understanding of why a research paper is predicted as replicable or non-replicable and therefore makes its real-world application more reliable and trustworthy. However, the prior works on model interpretation mainly focused on improving the model interpretability at the word/phrase level, which are insufficient especially for long research papers in RRP. Furthermore, the existing methods cannot utilize a large size of unlabeled dataset to further improve the model interpretability. To address these limitations, we aim to build an interpretable neural model which can provide sentence-level explanations and apply weakly supervised approach to further leverage the large corpus of unlabeled datasets to boost the interpretability in addition to improving prediction performance as existing works have done. In this work, we propose the Variational Contextual Consistency Sentence Masking (VCCSM) method to automatically extract key sentences based on the context in the classifier, using both labeled and unlabeled datasets. Results of our experiments on RRP along with European Convention of Human Rights (ECHR) datasets demonstrate that VCCSM is able to improve the model interpretability for the long document classification tasks using the area over the perturbation curve and post-hoc accuracy as evaluation metrics.