Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering

Fink, Laura; Franke, Linus; Egger, Bernhard; Keinert, Joachim; Stamminger, Marc

Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering

dc.contributor.author	Fink, Laura	en_US
dc.contributor.author	Franke, Linus	en_US
dc.contributor.author	Egger, Bernhard	en_US
dc.contributor.author	Keinert, Joachim	en_US
dc.contributor.author	Stamminger, Marc	en_US
dc.contributor.editor	Egger, Bernhard	en_US
dc.contributor.editor	Günther, Tobias	en_US
dc.date.accessioned	2025-09-24T10:37:18Z
dc.date.available	2025-09-24T10:37:18Z
dc.date.issued	2025
dc.description.abstract	Accurate depth estimation is at the core of many applications in computer graphics, vision, and robotics. Current state-ofthe- art monocular depth estimators, trained on extensive datasets, generalize well but lack 3D consistency needed for many applications. In this paper, we combine the strength of those generalizing monocular depth estimation techniques with multiview data by framing this as an analysis-by-synthesis optimization problem to lift and refine such relative depth maps to accurate error-free depth maps. After an initial global scale estimation through structure-from-motion point clouds, we further refine the depth map through optimization enforcing multi-view consistency via photometric and geometric losses with differentiable rendering of the meshed depth map. In a two-stage optimization, scaling is further refined first, and afterwards artifacts and errors in the depth map are corrected via nearby-view photometric supervision. Our evaluation shows that our method is able to generate detailed, high-quality, view consistent, accurate depth maps, also in challenging indoor scenarios, and outperforms state-of-the-art multi-view depth reconstruction approaches on such datasets. Project page and source code can be found at https://lorafib.github.io/ref_depth/.	en_US
dc.description.sectionheaders	Neural and Differentiable Rendering
dc.description.seriesinformation	Vision, Modeling, and Visualization
dc.identifier.doi	10.2312/vmv.20251232
dc.identifier.isbn	978-3-03868-294-3
dc.identifier.pages	10 pages
dc.identifier.uri	https://doi.org/10.2312/vmv.20251232
dc.identifier.uri	https://diglib.eg.org/handle/10.2312/vmv20251232
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Computing methodologies → Computer vision problems; Rasterization
dc.subject	Computing methodologies → Computer vision problems
dc.subject	Rasterization
dc.title	Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	en_US