通过激活和显着图来解释基于BERT的文本相似性

论文标题

通过激活和显着图来解释基于BERT的文本相似性

Interpreting BERT-based Text Similarity via Activation and Saliency Maps

论文作者

Malkiel, Itzik, Ginzburg, Dvir, Barkan, Oren, Caciularu, Avi, Weill, Jonathan, Koenigstein, Noam

论文摘要

最近，人们对基于变压器模型产生有意义的文本嵌入的能力越来越兴趣，并具有多种应用程序，例如文本相似性。尽管该领域取得了重大进展，但相似性预测的解释仍然具有挑战性，尤其是在无监督的环境中。在这项工作中，我们提出了一种无监督的技术，用于解释预先训练的BERT模型推断出的段落相似性。通过查看一对段落，我们的技术确定了决定每个段落的语义的重要单词，在这两个段落中的单词之间匹配，并检索解释两者之间相似性的最重要对。该方法已通过广泛的人类评估进行了评估，并在包含长期复杂段落的数据集中证明了该方法，已显示出巨大的希望，提供了与人类看法更好相关的准确解释。

Recently, there has been growing interest in the ability of Transformer-based models to produce meaningful embeddings of text with several applications, such as text similarity. Despite significant progress in the field, the explanations for similarity predictions remain challenging, especially in unsupervised settings. In this work, we present an unsupervised technique for explaining paragraph similarities inferred by pre-trained BERT models. By looking at a pair of paragraphs, our technique identifies important words that dictate each paragraph's semantics, matches between the words in both paragraphs, and retrieves the most important pairs that explain the similarity between the two. The method, which has been assessed by extensive human evaluations and demonstrated on datasets comprising long and complex paragraphs, has shown great promise, providing accurate interpretations that correlate better with human perceptions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题