Aanet: Adaptive Aggregation Network For Efficient Stereo Matching

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)(2020)

引用 456|浏览238
暂无评分
摘要
Despite the remarkable progress made by learning based stereo matching algorithms, one key challenge remains unsolved. Current state-of-the-art stereo models are mostly based on costly 3D convolutions, the cubic computational complexity and high memory consumption make it quite expensive to deploy in real-world applications. In this paper, we aim at completely replacing the commonly used 3D convolutions to achieve fast inference speed while maintaining comparable accuracy. To this end, we first propose a sparse points based intra-scale cost aggregation method to alleviate the well-known edge fattening issue at disparity discontinuities. Further, we approximate traditional cross-scale cost aggregation algorithm with neural network layers to handle large textureless regions. Both modules are simple, lightweight, and complementary, leading to an effective and efficient architecture for cost aggregation. With these two modules, we can not only significantly speed up existing top performing models (e.g., 41 x than GC-Net, 4 x than PSMNet and 38 x than GA-Net), but also improve the performance of fast stereo models (e.g., StereoNet). We also achieve competitive results on Scene Flow and KITTI datasets while running at 62ms, demonstrating the versatility and high efficiency of the proposed method. Our full framework is available at https : //github.com/haofeixu/aanet.
更多
查看译文
关键词
stereo matching algorithms,3D convolutions,cubic computational complexity,high memory consumption,edge-fattening issue,disparity discontinuities,neural network layers,adaptive aggregation network,intrascale cost aggregation method,cross-scale cost aggregation algorithm,KITTI datasets,PSMNet,time 62.0 ms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要