论文标题
超越原型:几次分割的划分代理
Beyond the Prototype: Divide-and-conquer Proxies for Few-shot Segmentation
论文作者
论文摘要
很少有射击细分旨在仅给出少数标记的样本,旨在细分看不见的级别对象,因此受到了社区的广泛关注。现有方法通常遵循原型学习范式执行元推断,这无法完全利用支持图像掩码对的基础信息,从而导致各种细分失败,例如,不完整的对象,模棱两可的界限,歧义性的边界和分散分心。为此,我们本着分裂和构造的精神提出了一个简单而多才多艺的框架。具体而言,首先在注释的支持图像上实现了一种新颖的自我调查方案,然后将粗分割掩码分为具有不同属性的多个区域。因此,利用有效的掩盖平均合并操作,得出了一系列支持引起的代理,每个代理都在征服上述挑战方面发挥了特定的作用。此外,我们设计了一种独特的并行解码器结构,该结构将代理集成具有相似属性以提高歧视能力。我们所提出的方法,名为Divide and-Conder Querquer代理(DCP),允许开发适当可靠的信息作为“情节”级别的指导,而不仅仅是对象提示本身。对Pascal-5i和Coco-20i的广泛实验表明,DCP优于基于常规原型的方法(平均最高5〜10%),这也建立了新的最新技术。代码可在github.com/chunbolang/dcp上找到。
Few-shot segmentation, which aims to segment unseen-class objects given only a handful of densely labeled samples, has received widespread attention from the community. Existing approaches typically follow the prototype learning paradigm to perform meta-inference, which fails to fully exploit the underlying information from support image-mask pairs, resulting in various segmentation failures, e.g., incomplete objects, ambiguous boundaries, and distractor activation. To this end, we propose a simple yet versatile framework in the spirit of divide-and-conquer. Specifically, a novel self-reasoning scheme is first implemented on the annotated support image, and then the coarse segmentation mask is divided into multiple regions with different properties. Leveraging effective masked average pooling operations, a series of support-induced proxies are thus derived, each playing a specific role in conquering the above challenges. Moreover, we devise a unique parallel decoder structure that integrates proxies with similar attributes to boost the discrimination power. Our proposed approach, named divide-and-conquer proxies (DCP), allows for the development of appropriate and reliable information as a guide at the "episode" level, not just about the object cues themselves. Extensive experiments on PASCAL-5i and COCO-20i demonstrate the superiority of DCP over conventional prototype-based approaches (up to 5~10% on average), which also establishes a new state-of-the-art. Code is available at github.com/chunbolang/DCP.