论文标题
部分可观测时空混沌系统的无模型预测
LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous Driving
论文作者
论文摘要
图像实例细分是自动驾驶中的一个基本研究主题,这对于场景的理解和道路安全至关重要。基于先进的学习方法通常依赖于代价高昂的2D面具注释来培训。在本文中,我们提出了一个更为巧妙的框架,激光雷达引导弱监督的实例细分(LWSIS),该框架利用了现成的3D数据,即点云以及3D框以及3D框,作为培训2D图像实例分割模型的自然弱监管。我们的LWSI不仅在训练过程中利用了多模式数据中的互补信息,而且还大大降低了密集的2D口罩的注释成本。详细说明,LWSIS由两个关键模块,点标签分配(PLA)和基于图的一致性正则化(GCR)组成。前一个模块旨在自动将3D点云分配为2D点标签,而后者通过执行多模式数据的几何形状和外观一致性进一步完善了预测。此外,我们对名为Nuinsseg的Nuscenes进行了次要实例分割注释,以鼓励对多模式感知任务的进一步研究。关于Nuinsseg的广泛实验以及大规模的Waymo,表明LWSI可以通过仅在培训过程中涉及3D数据来实质上改善现有的弱监督分割模型。此外,LWSI也可以将其合并到3D对象检测器中,例如免费提高3D检测性能。代码和数据集可在https://github.com/serenos/lwsis上找到。
Image instance segmentation is a fundamental research topic in autonomous driving, which is crucial for scene understanding and road safety. Advanced learning-based approaches often rely on the costly 2D mask annotations for training. In this paper, we present a more artful framework, LiDAR-guided Weakly Supervised Instance Segmentation (LWSIS), which leverages the off-the-shelf 3D data, i.e., Point Cloud, together with the 3D boxes, as natural weak supervisions for training the 2D image instance segmentation models. Our LWSIS not only exploits the complementary information in multimodal data during training, but also significantly reduces the annotation cost of the dense 2D masks. In detail, LWSIS consists of two crucial modules, Point Label Assignment (PLA) and Graph-based Consistency Regularization (GCR). The former module aims to automatically assign the 3D point cloud as 2D point-wise labels, while the latter further refines the predictions by enforcing geometry and appearance consistency of the multimodal data. Moreover, we conduct a secondary instance segmentation annotation on the nuScenes, named nuInsSeg, to encourage further research on multimodal perception tasks. Extensive experiments on the nuInsSeg, as well as the large-scale Waymo, show that LWSIS can substantially improve existing weakly supervised segmentation models by only involving 3D data during training. Additionally, LWSIS can also be incorporated into 3D object detectors like PointPainting to boost the 3D detection performance for free. The code and dataset are available at https://github.com/Serenos/LWSIS.