Title: Foreground-aware Dense Depth Estimation for 360 Images
Authors: Feng, Qi
Shum, Hubert P. H.
Shimamura, Ryo
Morishima, Shigeo
Citation: Journal of WSCG. 2020, vol. 28, no. 1-2, p. 79-88.
Issue Date: 2020
Publisher: Václav Skala - UNION Agency
Document type: article
URI: http://wscg.zcu.cz/WSCG2020/2020-J_WSCG-1-2.pdf
ISSN: 1213-6972 (print)
1213-6980 (CD-ROM)
1213-6964 (on-line)
Keywords: odhad hloubky;porozumění scéně;rozšiřování dat;360 obrázků
Keywords in different language: depth estimation;scene understanding;data augmentation;360 images
Abstract in different language: With 360 imaging devices becoming widely accessible, omnidirectional content has gained popularity in multiple fields. The ability to estimate depth from a single omnidirectional image can benefit applications such as robotics navigation and virtual reality. However, existing depth estimation approaches produce sub-optimal results on real-world omnidirectional images with dynamic foreground objects. On the one hand, capture-based methods cannot obtain the foreground due to the limitations of the scanning and stitching schemes. On the other hand, it is challenging for synthesis-based methods to generate highly-realistic virtual foreground objects that are comparable to the real-world ones. In this paper, we propose to augment datasets with realistic foreground objects using an image-based approach, which produces a foreground-aware photorealistic dataset for machine learning algorithms. By exploiting a novel scale-invariant RGB-D orrespondence in the spherical domain, we repurpose abundant non-omnidirectional datasets to include realistic foreground objects with correct distortions. We further propose a novel auxiliary deep neural network to estimate both the depth of the omnidirectional images and the mask of the foreground objects, where the two tasks facilitate each other. A new local depth loss considers small regions of interests and ensures that their depth estimations are not smoothed out during the global gradient’s optimization. We demonstrate the system using human as the foreground due to its complexity and contextual importance, while the framework can be generalized to any other foreground objects. Experimental results demonstrate more consistent global estimations and more accurate local estimations compared with state-of-the-arts.
Rights: © Václav Skala - UNION Agency
Appears in Collections:Volume 28, Number 1-2 (2020)

Files in This Item:
File Description SizeFormat 
Feng.pdfPlný text10,45 MBAdobe PDFView/Open

Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/38428

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.