神经室：室内现场重建的几何受限的神经隐式表面

论文标题

神经室：室内现场重建的几何受限的神经隐式表面

NeuralRoom: Geometry-Constrained Neural Implicit Surfaces for Indoor Scene Reconstruction

论文作者

Wang, Yusen, Li, Zongcheng, Jiang, Yu, Zhou, Kaixuan, Cao, Tuo, Fu, Yanping, Xiao, Chunxia

论文摘要

我们提出了一种新型的神经表面重建方法，称为神经室，用于直接从一组2D图像中重建房间大小的室内场景。最近，由于其高质量的结果和简单性，隐式神经表示已成为从多视图像重建表面的一种有希望的方法。但是，隐式神经表示通常不能很好地重建室内场景，因为它们遭受了严重的形状误解歧义。我们假设室内场景由质地丰富和无纹理的区域组成。在质地丰富的区域中，多视立体声可以获得准确的结果。在平面区域，正常估计网络通常获得良好的正常估计。基于上述观察结果，我们通过可靠的几何学先验来减少隐式神经表面的空间变化范围，从而减轻形状范围的歧义。具体而言，我们使用多视立体声结果来限制神经房间优化空间，然后使用可靠的几何先验来指导神经室训练。然后，神经室将产生神经场景表示形式，可以渲染与输入训练图像一致的图像。此外，我们提出了一种称为扰动 - 残基限制的平滑方法，以提高平坦区域的准确性和完整性，该方法假设局部表面中的采样点应具有与观察中心相同的正常距离和相似的距离。扫描仪数据集的实验表明，我们的方法可以重建室内场景的无纹理区域，同时保持细节的准确性。我们还将神经室应用于更高级的多视频重建算法，并显着提高其重建质量。

We present a novel neural surface reconstruction method called NeuralRoom for reconstructing room-sized indoor scenes directly from a set of 2D images. Recently, implicit neural representations have become a promising way to reconstruct surfaces from multiview images due to their high-quality results and simplicity. However, implicit neural representations usually cannot reconstruct indoor scenes well because they suffer severe shape-radiance ambiguity. We assume that the indoor scene consists of texture-rich and flat texture-less regions. In texture-rich regions, the multiview stereo can obtain accurate results. In the flat area, normal estimation networks usually obtain a good normal estimation. Based on the above observations, we reduce the possible spatial variation range of implicit neural surfaces by reliable geometric priors to alleviate shape-radiance ambiguity. Specifically, we use multiview stereo results to limit the NeuralRoom optimization space and then use reliable geometric priors to guide NeuralRoom training. Then the NeuralRoom would produce a neural scene representation that can render an image consistent with the input training images. In addition, we propose a smoothing method called perturbation-residual restrictions to improve the accuracy and completeness of the flat region, which assumes that the sampling points in a local surface should have the same normal and similar distance to the observation center. Experiments on the ScanNet dataset show that our method can reconstruct the texture-less area of indoor scenes while maintaining the accuracy of detail. We also apply NeuralRoom to more advanced multiview reconstruction algorithms and significantly improve their reconstruction quality.

下载PDF全文

下载文献需遵守相关版权规定

论文标题