论文标题
MaskGroup:3D实例分割的分层分组和掩模
MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation
论文作者
论文摘要
本文研究了3D实例细分问题,该问题具有多种现实应用程序,例如机器人技术和增强现实。由于3D对象的周围环境具有很高的复杂性,因此不同对象的分离非常困难。为了解决这个具有挑战性的问题,我们提出了一个新颖的框架,以分组和完善3D实例。实际上,我们首先学习每个点的偏移矢量,然后将其转移到其预测的实例中心。为了更好地分组这些要点,我们提出了一个分层分组算法,以逐步合并集中汇总点。所有点都分为小簇,进一步逐渐接受另一个聚类程序,将其合并为较大的组。这些多尺度组被利用,例如预测,这对于预测具有不同尺度的实例是有益的。此外,开发了一种新型的maskscorenet,以产生这些组的二元点面具,以进一步完善分割结果。在ScannETV2和S3DIS基准上进行的广泛实验证明了该方法的有效性。例如,我们的方法在ScannETV2测试集上具有0.5 IOU阈值,获得了66.4 \%的映射,该阈值比最先进的方法高1.9 \%。
This paper studies the 3D instance segmentation problem, which has a variety of real-world applications such as robotics and augmented reality. Since the surroundings of 3D objects are of high complexity, the separating of different objects is very difficult. To address this challenging problem, we propose a novel framework to group and refine the 3D instances. In practice, we first learn an offset vector for each point and shift it to its predicted instance center. To better group these points, we propose a Hierarchical Point Grouping algorithm to merge the centrally aggregated points progressively. All points are grouped into small clusters, which further gradually undergo another clustering procedure to merge into larger groups. These multi-scale groups are exploited for instance prediction, which is beneficial for predicting instances with different scales. In addition, a novel MaskScoreNet is developed to produce binary point masks of these groups for further refining the segmentation results. Extensive experiments conducted on the ScanNetV2 and S3DIS benchmarks demonstrate the effectiveness of the proposed method. For instance, our approach achieves a 66.4\% mAP with the 0.5 IoU threshold on the ScanNetV2 test set, which is 1.9\% higher than the state-of-the-art method.