论文标题
通过逻辑推理自然语言解释的零射击分类
Zero-Shot Classification by Logical Reasoning on Natural Language Explanations
论文作者
论文摘要
人类可以通过推理其语言解释来对未见类别的数据进行分类。这种能力是由于语言的组成性质:我们可以结合以前看到的属性来描述新类别。例如,我们可能会将鼠尾草的刺耳描述为“它的直毛,黄色的眼睛和长尾巴”,以便其他人可以利用他们对属性的知识“纤细的直角相对较短的钞票”,“黄眼”和“长尾巴”来识别鼠尾草刺痛器。受到这一观察的启发,在这项工作中,我们通过逻辑解析和推理自然语言解释来解决零拍的任务。为此,我们提出了框架的框架(通过解释中的逻辑推理进行分类)。虽然以前的方法通常将文本信息视为隐式特征,但Clore将解释解释为逻辑结构,然后明确地沿着输入上的Thess结构进行明确原因以产生分类分数。基于解释的零击分类基准的实验结果表明,Clore优于基准,我们进一步表明,这主要来自需要更逻辑推理的任务分数。我们还证明,我们的框架可以扩展到视觉模态上的零击分类。除了分类的决策外,Clore可以提供逻辑解析和推理过程,作为明确的理由形式。通过经验分析,我们证明,与基础线相比,岩石也受到语言偏见的影响。
Humans can classify data of an unseen category by reasoning on its language explanations. This ability is owing to the compositional nature of language: we can combine previously seen attributes to describe the new category. For example, we might describe a sage thrasher as "it has a slim straight relatively short bill, yellow eyes and a long tail", so that others can use their knowledge of attributes "slim straight relatively short bill", "yellow eyes" and "long tail" to recognize a sage thrasher. Inspired by this observation, in this work we tackle zero-shot classification task by logically parsing and reasoning on natural language expla-nations. To this end, we propose the framework CLORE (Classification by LOgical Reasoning on Explanations). While previous methods usually regard textual information as implicit features, CLORE parses explanations into logical structures and then explicitly reasons along thess structures on the input to produce a classification score. Experimental results on explanation-based zero-shot classification benchmarks demonstrate that CLORE is superior to baselines, which we further show mainly comes from higher scores on tasks requiring more logical reasoning. We also demonstrate that our framework can be extended to zero-shot classification on visual modality. Alongside classification decisions, CLORE can provide the logical parsing and reasoning process as a clear form of rationale. Through empirical analysis we demonstrate that CLORE is also less affected by linguistic biases than baselines.