LDM3D-VR: Latent Diffusion Model for 3D VR.

Gabriela Ben Melech Stan,Diana Wofk,Estelle Aflalo,Shao-Yen Tseng, Zhipeng Cai, Michael Paulitsch,Vasudev Lal

CoRR(2023)

引用 0|浏览12
暂无评分
摘要
Latent diffusion models have proven to be state-of-the-art in the creation and manipulation of visual outputs. However, as far as we know, the generation of depth maps jointly with RGB is still limited. We introduce LDM3D-VR, a suite of diffusion models targeting virtual reality development that includes LDM3D-pano and LDM3D-SR. These models enable the generation of panoramic RGBD based on textual prompts and the upscaling of low-resolution inputs to high-resolution RGBD, respectively. Our models are fine-tuned from existing pretrained models on datasets containing panoramic/high-resolution RGB images, depth maps and captions. Both models are evaluated in comparison to existing related methods.
更多
查看译文
关键词
latent diffusion model,diffusion model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要