Varun Jampani*
Huiwen Chang*
Kyle Sargent
Abhishek Kar
Richard Tucker
Michael Krainin
Dominik Kaeser
William T. Freeman
David Salesin
Brian Curless
Ce Liu
International Conference on Computer Vision (ICCV 2021, Oral)
Single image 3D photography enables viewers to view a still image from novel viewpoints. Recent approaches combine monocular depth networks with inpainting networks to achieve compelling results. A drawback of these techniques is the use of hard depth layering, making them unable to model intricate appearance details such as thin hair-like structures. We present SLIDE, a modular and unified system for single image 3D photography that uses a simple yet effective soft layering strategy to better preserve appearance details in novel views. In addition, we propose a novel depth-aware training strategy for our inpainting module, better suited for the 3D photography task. The resulting SLIDE approach is modular, enabling the use of other components such as segmentation and matting for improved layering. At the same time, SLIDE uses an efficient layered depth formulation that only requires a single forward pass through the component networks to produce high quality 3D photos. Extensive experimental analysis on three view-synthesis datasets, in combination with user studies on in-the-wild image collections, demonstrate superior performance of our technique in comparison to existing strong baselines while being conceptually much simpler.
@inproceedings{jampani:ICCV:2021,
title = {SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting},
author = {Jampani, Varun and Chang, Huiwen and Sargent, Kyle and Kar, Abhishek and Tucker, Richard and Krainin, Michael and Kaeser, Dominik and Freeman, William T and Salesin, David and Curless, Brian and Liu, Ce},
booktitle={Proceedings of the IEEE International Conference on Computer Vision},
year={2021}
}