论文标题
基于深度学习的单眼深度估计:概述
Monocular Depth Estimation Based On Deep Learning: An Overview
论文作者
论文摘要
深度信息对于自主系统感知环境和估计自己的状态很重要。传统的深度估计方法,例如运动和立体声视觉匹配的结构,都建立在多个观点的特征对应关系上。同时,预测的深度图很少。从单个图像(单眼深度估计)中推断深度信息是一个问题。随着深度神经网络的快速发展,最近对基于深度学习的单眼深度估计进行了广泛的研究,并在准确性方面取得了有希望的表现。同时,密集的深度图以端到端的方式通过深度神经网络从单个图像中估算出来。为了提高深度估计的准确性,随后提出了各种网络框架,损失功能和培训策略。因此,我们根据本综述的深度学习调查了当前的单眼深度估计方法。最初,我们在基于深度学习的深度估计中结论了一些广泛使用的数据集和评估指标。此外,我们根据不同的培训方式回顾了一些代表性的现有方法:受监督,无监督和半监督。最后,我们讨论了挑战,并为将来的单眼深度估计提供了一些想法。
Depth information is important for autonomous systems to perceive environments and estimate their own state. Traditional depth estimation methods, like structure from motion and stereo vision matching, are built on feature correspondences of multiple viewpoints. Meanwhile, the predicted depth maps are sparse. Inferring depth information from a single image (monocular depth estimation) is an ill-posed problem. With the rapid development of deep neural networks, monocular depth estimation based on deep learning has been widely studied recently and achieved promising performance in accuracy. Meanwhile, dense depth maps are estimated from single images by deep neural networks in an end-to-end manner. In order to improve the accuracy of depth estimation, different kinds of network frameworks, loss functions and training strategies are proposed subsequently. Therefore, we survey the current monocular depth estimation methods based on deep learning in this review. Initially, we conclude several widely used datasets and evaluation indicators in deep learning-based depth estimation. Furthermore, we review some representative existing methods according to different training manners: supervised, unsupervised and semi-supervised. Finally, we discuss the challenges and provide some ideas for future researches in monocular depth estimation.