DDD: Deep indoor panoramic Depth estimation with Density maps consistency

Pintore, Giovanni; Agus, Marco; Signoroni, Alberto; Gobbetti, Enrico

DDD: Deep indoor panoramic Depth estimation with Density maps consistency

dc.contributor.author	Pintore, Giovanni	en_US
dc.contributor.author	Agus, Marco	en_US
dc.contributor.author	Signoroni, Alberto	en_US
dc.contributor.author	Gobbetti, Enrico	en_US
dc.contributor.editor	Caputo, Ariel	en_US
dc.contributor.editor	Garro, Valeria	en_US
dc.contributor.editor	Giachetti, Andrea	en_US
dc.contributor.editor	Castellani, Umberto	en_US
dc.contributor.editor	Dulecha, Tinsae Gebrechristos	en_US
dc.date.accessioned	2024-11-11T12:48:07Z
dc.date.available	2024-11-11T12:48:07Z
dc.date.issued	2024
dc.description.abstract	We introduce a novel deep neural network for rapid and structurally consistent monocular 360◦ depth estimation in indoor environments. The network infers a depth map from a single gravity-aligned or gravity-rectified equirectangular image of the environment, ensuring that the predicted depth aligns with the typical depth distribution and features of cluttered interior spaces, which are usually enclosed by walls, ceilings, and floors. By leveraging the distinct characteristics of vertical and horizontal features in man-made indoor environments, we introduce a lean network architecture that employs gravity-aligned feature flattening and specialized vision transformers that utilize the input's omnidirectional nature, without segmentation into patches and positional encoding. To enhance the structural consistency of the predicted depth, we introduce a new loss function that evaluates the consistency of density maps by projecting points derived from the inferred depth map onto horizontal and vertical planes. This lightweight architecture has very small computational demands, provides greater structural consistency than competing methods, and does not require the explicit imposition of strong structural priors.	en_US
dc.description.sectionheaders	Computer Vision
dc.description.seriesinformation	Smart Tools and Applications in Graphics - Eurographics Italian Chapter Conference
dc.identifier.doi	10.2312/stag.20241336
dc.identifier.isbn	978-3-03868-265-3
dc.identifier.issn	2617-4855
dc.identifier.pages	10 pages
dc.identifier.uri	https://doi.org/10.2312/stag.20241336
dc.identifier.uri	https://diglib.eg.org/handle/10.2312/stag20241336
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Computing methodologies → Computer vision; Shape inference; Neural networks
dc.subject	Computing methodologies → Computer vision
dc.subject	Shape inference
dc.subject	Neural networks
dc.title	DDD: Deep indoor panoramic Depth estimation with Density maps consistency	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: stag20241336.pdf
Size:: 5.74 MB
Format:: Adobe Portable Document Format

Download

Collections

Italian Chapter Conference 2024 - Smart Tools and Apps in Graphics