Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Talker, Lior; Cohen, Aviad; Yosef, Erez; Dana, Alexandra; Dinerstein, Michael

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.05315 (cs)

[Submitted on 10 Dec 2022 (v1), last revised 3 Apr 2024 (this version, v3)]

Title:Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Authors:Lior Talker, Aviad Cohen, Erez Yosef, Alexandra Dana, Michael Dinerstein

View PDF HTML (experimental)

Abstract:Monocular Depth Estimation (MDE) is a fundamental problem in computer vision with numerous applications. Recently, LIDAR-supervised methods have achieved remarkable per-pixel depth accuracy in outdoor scenes. However, significant errors are typically found in the proximity of depth discontinuities, i.e., depth edges, which often hinder the performance of depth-dependent applications that are sensitive to such inaccuracies, e.g., novel view synthesis and augmented reality. Since direct supervision for the location of depth edges is typically unavailable in sparse LIDAR-based scenes, encouraging the MDE model to produce correct depth edges is not straightforward. To the best of our knowledge this paper is the first attempt to address the depth edges issue for LIDAR-supervised scenes. In this work we propose to learn to detect the location of depth edges from densely-supervised synthetic data, and use it to generate supervision for the depth edges in the MDE training. To quantitatively evaluate our approach, and due to the lack of depth edges GT in LIDAR-based scenes, we manually annotated subsets of the KITTI and the DDAD datasets with depth edges ground truth. We demonstrate significant gains in the accuracy of the depth edges with comparable per-pixel depth accuracy on several challenging datasets. Code and datasets are available at \url{this https URL}.

Comments:	Appears in CVPR24'
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2212.05315 [cs.CV]
	(or arXiv:2212.05315v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.05315

Submission history

From: Lior Talker [view email]
[v1] Sat, 10 Dec 2022 14:49:24 UTC (20,748 KB)
[v2] Wed, 6 Sep 2023 06:58:29 UTC (29,664 KB)
[v3] Wed, 3 Apr 2024 11:03:52 UTC (35,152 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators