Multi-Task Learning Of Depth From Tele And Wide Stereo Image Pairs
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)(2019)
摘要
In this paper, we introduce the problem of estimating the real world depth of elements in a scene captured by two cameras with different field of views, where the first field of view (FOV) is a Wide FOV obtained by a lens with 1x the optical zoom, and the second FOV is contained in the first FOV and corresponds to a tele zoom lens with 2x the optical zoom. Traditional stereo matching techniques can estimate the stereo disparity, and hence the depth, in the overlapping FOV between both cameras only, which corresponds to the Tele FOV. We refer to the problem of estimating the disparity, or inverse depth, for the union of FOVs as 'Tele-Wide disparity estimation'. We propose different deep learning solutions to establish baseline performances. We trained a single-image inverse-depth estimation (SIDE) network to estimate the inverse depth from the image corresponding to the Wide FOV only. We also trained a stereo image disparity estimation network to estimate the disparity for the overlapping Tele FOV only, and another tele-wide stereo matching network (TW-SMNet) for estimating the disparity for the union Wide FOV. We further propose an end-to-end multi-task tele-wide stereo matching deep neural network (MT-TW-SMNet) which attempts to do stereo matching in the overlapped Tele FOV and SIDE in the union Wide FOV. Experimental results on KITTI and the SceneFlow datasets establish baseline performances for the tele-wide stereo matching and demonstrate that multitask tele-wide stereo matching provides a reasonable solution to the Tele-Wide depth estimation problem.
更多查看译文
关键词
Tele-wide stereo, disparity estimation, stereo vision, inverse depth, multi-task learning, field of view
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络