Deep Learning Depth Estimation Methods Overview by Categories

This is essentially a simplified version of Monocular Depth Estimation Based on Deep Learning: An Overview by Zhao et al. with some comments.

Supervised

Methods based on different architectures and loss functions

Eigen. Depth map prediction from a single image using a multi-scale deep network
Eigen. Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture
Mayer. A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation
Shelhamer. Scene intrinsics and depth from a single image
Laina. Deeper depth prediction with fully convolutional residual networks
- residual learning
- reverse Huber loss (berhu) for better result than L2
Mancini. Fast robust monocular depth estimation for obstacle detection with fully convolutional networks
- use image and optical flow to estimate depth
Chen. Single image depth prediction in the wild (2016, DIW)
- new DIW dataset and relative depth annotations
Fu. Deep ordinal regression network for monocular depth estimation (DORN)

Methods based on conditional random fields

Li. Depth and surface normal estimation from monocular images using regression on deep features and hierarchical CRFs
- conditional random fields
- super pixel?
Liu. Learning depth from single monocular images using deep convolutional neural fields
Wang. Towards unified depth and semantic prediction from a single image
- utilizing semantic consistency between depth and semantic labels
Zhang. Joint task-recursive learning for semantic segmentation and depth estimation
Xu. Structured attention guided convolutional neural fields for monocular depth estimation

Methods based on adversarial learning

Feng. SGANVO: Unsupervised deep visual odometry and depth estimation with stacked generative adversarial networks (IEEE, 2019)
Jung. Depth prediction from a single image with conditional adversarial networks (ICIP, 2017)
Gwn Lore. Generative adversarial networks for depth map estimation from RGB video (CVPR, 2018)

Unsupervised

Basic model

Trained using monocular sequences, these methods project the prediction of one frame to the next. The camera intrinsics need to be known.

Zhou. Unsupervised learning of depth and ego-motion from video
Godard. Unsupervised monocular depth estimation with left-right consistency (CVPR, 2017)

Methods based on explainability mask

The aforementioned models are based on the static-object assumption. To solve the problem that the assumption may not hold in real world, explainability masks are proposed to identify only the static objects.

Methods based on traditional visual odometry

TBA

Methods based on multi-tasks framework

TBA

Methods based on adversarial learning

TBA

Semi-supervised monocular depth estimation

Basic model

Trained on stereo images, semi-supervised methods use inverse warping guided by the predicted disparity.

Methods based on stereo matching

Luo. Single view stereo matching (CVPR, 2018)

Methods based on adversarial learning and knowledge distillation

Pilzer. Refine and distill: Exploiting cycle-inconsistency and knowledge distillation for unsupervised monocular depth estimation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

methods_overview_1.md

methods_overview_1.md

Deep Learning Depth Estimation Methods Overview by Categories

Supervised

Methods based on different architectures and loss functions

Methods based on conditional random fields

Methods based on adversarial learning

Unsupervised

Basic model

Methods based on explainability mask

Methods based on traditional visual odometry

Methods based on multi-tasks framework

Methods based on adversarial learning

Semi-supervised monocular depth estimation

Basic model

Methods based on stereo matching

Methods based on adversarial learning and knowledge distillation

Files

methods_overview_1.md

Latest commit

History

methods_overview_1.md

File metadata and controls

Deep Learning Depth Estimation Methods Overview by Categories

Supervised

Methods based on different architectures and loss functions

Methods based on conditional random fields

Methods based on adversarial learning

Unsupervised

Basic model

Methods based on explainability mask

Methods based on traditional visual odometry

Methods based on multi-tasks framework

Methods based on adversarial learning

Semi-supervised monocular depth estimation

Basic model

Methods based on stereo matching

Methods based on adversarial learning and knowledge distillation