从多视图图像结合体素超分辨率和学习隐式表示的多视图图像中详细的3D人体重建

论文标题

从多视图图像结合体素超分辨率和学习隐式表示的多视图图像中详细的3D人体重建

Detailed 3D Human Body Reconstruction from Multi-view Images Combining Voxel Super-Resolution and Learned Implicit Representation

论文作者

Li, Zhongguo, Oskarsson, Magnus, Heyden, Anders

论文摘要

从图像中重建详细的3D人体模型的任务很有趣，但是由于人体的高自由度，在计算机视觉中具有挑战性。为了解决这个问题，我们提出了一种粗到精细的方法，以根据学习隐式表示，从多视图图像中重建一个详细的3D人体。首先，通过学习基于多尺度特征的隐式表示来估算粗3D模型，这些特征是由多视图图像从多级沙漏网络中提取的。然后，以粗分辨率3D模型生成的低分辨率体素网格作为输入，基于隐式表示的体素超分辨率是通过多阶段3D卷积神经网络学习的。最后，可以通过体素超级分辨率产生精致的详细3D人体模型，该模型可以保留细节并减少粗3D模型的错误重建。从隐性表示中受益，我们方法中的训练过程是有效的，并且通过多视图图像产生的详细的3D人体是具有高分辨率几何形状的连续决策边界。此外，基于体素超分辨率的粗到加密方法可以同时消除错误的重建并保留最终重建中的外观细节。在实验中，我们的方法在定量和定性上从具有各种姿势和形状的真实和合成数据集的图像中实现了具有竞争力的3D人体重建。

The task of reconstructing detailed 3D human body models from images is interesting but challenging in computer vision due to the high freedom of human bodies. In order to tackle the problem, we propose a coarse-to-fine method to reconstruct a detailed 3D human body from multi-view images combining voxel super-resolution based on learning the implicit representation. Firstly, the coarse 3D models are estimated by learning an implicit representation based on multi-scale features which are extracted by multi-stage hourglass networks from the multi-view images. Then, taking the low resolution voxel grids which are generated by the coarse 3D models as input, the voxel super-resolution based on an implicit representation is learned through a multi-stage 3D convolutional neural network. Finally, the refined detailed 3D human body models can be produced by the voxel super-resolution which can preserve the details and reduce the false reconstruction of the coarse 3D models. Benefiting from the implicit representation, the training process in our method is memory efficient and the detailed 3D human body produced by our method from multi-view images is the continuous decision boundary with high-resolution geometry. In addition, the coarse-to-fine method based on voxel super-resolution can remove false reconstructions and preserve the appearance details in the final reconstruction, simultaneously. In the experiments, our method quantitatively and qualitatively achieves the competitive 3D human body reconstructions from images with various poses and shapes on both the real and synthetic datasets.

下载PDF全文

下载文献需遵守相关版权规定

论文标题