超越原型：几次分割的划分代理

论文标题

超越原型：几次分割的划分代理

Beyond the Prototype: Divide-and-conquer Proxies for Few-shot Segmentation

论文作者

Lang, Chunbo, Tu, Binfei, Cheng, Gong, Han, Junwei

论文摘要

很少有射击细分旨在仅给出少数标记的样本，旨在细分看不见的级别对象，因此受到了社区的广泛关注。现有方法通常遵循原型学习范式执行元推断，这无法完全利用支持图像掩码对的基础信息，从而导致各种细分失败，例如，不完整的对象，模棱两可的界限，歧义性的边界和分散分心。为此，我们本着分裂和构造的精神提出了一个简单而多才多艺的框架。具体而言，首先在注释的支持图像上实现了一种新颖的自我调查方案，然后将粗分割掩码分为具有不同属性的多个区域。因此，利用有效的掩盖平均合并操作，得出了一系列支持引起的代理，每个代理都在征服上述挑战方面发挥了特定的作用。此外，我们设计了一种独特的并行解码器结构，该结构将代理集成具有相似属性以提高歧视能力。我们所提出的方法，名为Divide and-Conder Querquer代理（DCP），允许开发适当可靠的信息作为“情节”级别的指导，而不仅仅是对象提示本身。对Pascal-5i和Coco-20i的广泛实验表明，DCP优于基于常规原型的方法（平均最高5〜10％），这也建立了新的最新技术。代码可在github.com/chunbolang/dcp上找到。

Few-shot segmentation, which aims to segment unseen-class objects given only a handful of densely labeled samples, has received widespread attention from the community. Existing approaches typically follow the prototype learning paradigm to perform meta-inference, which fails to fully exploit the underlying information from support image-mask pairs, resulting in various segmentation failures, e.g., incomplete objects, ambiguous boundaries, and distractor activation. To this end, we propose a simple yet versatile framework in the spirit of divide-and-conquer. Specifically, a novel self-reasoning scheme is first implemented on the annotated support image, and then the coarse segmentation mask is divided into multiple regions with different properties. Leveraging effective masked average pooling operations, a series of support-induced proxies are thus derived, each playing a specific role in conquering the above challenges. Moreover, we devise a unique parallel decoder structure that integrates proxies with similar attributes to boost the discrimination power. Our proposed approach, named divide-and-conquer proxies (DCP), allows for the development of appropriate and reliable information as a guide at the "episode" level, not just about the object cues themselves. Extensive experiments on PASCAL-5i and COCO-20i demonstrate the superiority of DCP over conventional prototype-based approaches (up to 5~10% on average), which also establishes a new state-of-the-art. Code is available at github.com/chunbolang/DCP.

下载PDF全文

下载文献需遵守相关版权规定

论文标题