Exploring Spatial-Temporal Multi-Frequency Analysis For High-Fidelity And Temporal-Consistency Video Prediction

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)(2020)

引用 109|浏览447
暂无评分
摘要
Video prediction is a pixel-wise dense prediction task to infer future frames based on past frames. Missing appearance details and motion blur are still two major problems for current models, leading to image distortion and temporal inconsistency. We point out the necessity of exploring multi frequency analysis to deal with the two problems. Inspired by the frequency band decomposition characteristic of Human Vision System (HVS), we propose a video prediction network based on multi-level wavelet analysis to uniformly deal with spatial and temporal information. Specifically, multi-level spatial discrete wavelet transform decomposes each video frame into anisotropic sub-bands with multiple frequencies, helping to enrich structural information and reserve fine details. On the other hand, multi-level temporal discrete wavelet transform which operates on time axis decomposes the frame sequence into sub-band groups of different frequencies to accurately capture multi-frequency motions under a fixed frame rate. Extensive experiments on diverse datasets demonstrate that our model shows significant improvements on fidelity and temporal consistency over the state-of-the-art works. Source code and videos are available at https://gitub.com/Bei-Jin/STMFANet.
更多
查看译文
关键词
temporal inconsistency,frequency band decomposition,Human Vision System,video prediction network,multilevel wavelet analysis,spatial information,temporal information,multilevel spatial discrete,video frame,anisotropic sub-bands,structural information,reserve fine details,multilevel temporal discrete wavelet,frame sequence,sub-band groups,fixed frame rate,temporal consistency,spatial-temporal multifrequency analysis,temporal-consistency video prediction,pixel-wise dense prediction task,appearance details,motion blur,image distortion
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要