深度灌注：可激发，摄像头和雷达的强大和模块化3D对象检测器

论文标题

深度灌注：可激发，摄像头和雷达的强大和模块化3D对象检测器

DeepFusion: A Robust and Modular 3D Object Detector for Lidars, Cameras and Radars

论文作者

Drews, Florian, Feng, Di, Faion, Florian, Rosenbaum, Lars, Ulrich, Michael, Gläser, Claudius

论文摘要

我们提出了DeepFusion，这是一种模块化的多模式结构，可在不同组合中以3D对象检测为融合激光雷达，相机和雷达。专门的功能提取器可以利用每种方式，并且可以轻松交换，从而使该方法变得简单而灵活。提取的特征被转化为鸟眼视图，作为融合的共同表示。在特征空间中融合方式之前，先进行空间和语义对齐方式。最后，检测头利用丰富的多模式特征，以提高3D检测性能。激光摄像头，激光摄像头雷达和摄像头融合的实验结果显示了我们融合方法的灵活性和有效性。在此过程中，我们研究了高达225米的遥远汽车检测的很大程度上未开发的任务，显示了激光摄像机融合的好处。此外，我们研究了3D对象检测的LIDAR点所需的密度，并以鲁棒性抵抗不利天气条件的示例说明了含义。此外，对我们的摄像头融合的消融研究突出了准确深度估计的重要性。

We propose DeepFusion, a modular multi-modal architecture to fuse lidars, cameras and radars in different combinations for 3D object detection. Specialized feature extractors take advantage of each modality and can be exchanged easily, making the approach simple and flexible. Extracted features are transformed into bird's-eye-view as a common representation for fusion. Spatial and semantic alignment is performed prior to fusing modalities in the feature space. Finally, a detection head exploits rich multi-modal features for improved 3D detection performance. Experimental results for lidar-camera, lidar-camera-radar and camera-radar fusion show the flexibility and effectiveness of our fusion approach. In the process, we study the largely unexplored task of faraway car detection up to 225 meters, showing the benefits of our lidar-camera fusion. Furthermore, we investigate the required density of lidar points for 3D object detection and illustrate implications at the example of robustness against adverse weather conditions. Moreover, ablation studies on our camera-radar fusion highlight the importance of accurate depth estimation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题