Manhattan Room Layout Reconstruction from a Single $$360^{\circ }$$ 360 ∘ Image: A Comparative Study of State-of-the-Art Methods

INTERNATIONAL JOURNAL OF COMPUTER VISION(2021)

引用 47|浏览76
暂无评分
摘要
Recent approaches for predicting layouts from 360 $$^{\circ }$$ panoramas produce excellent results. These approaches build on a common framework consisting of three steps: a pre-processing step based on edge-based alignment, prediction of layout elements, and a post-processing step by fitting a 3D layout to the layout elements. Until now, it has been difficult to compare the methods due to multiple different design decisions, such as the encoding network (e.g., SegNet or ResNet), type of elements predicted (e.g., corners, wall/floor boundaries, or semantic segmentation), or method of fitting the 3D layout. To address this challenge, we summarize and describe the common framework, the variants, and the impact of the design decisions. For a complete evaluation, we also propose extended annotations for the Matterport3D dataset (Chang et al.: Matterport3d: learning from rgb-d data in indoor environments. arXiv:1709.06158 , 2017), and introduce two depth-based evaluation metrics.
更多
查看译文
关键词
3D room layout, Deep learning, Single image 3D, Manhattan world
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要