论文标题
一次:与高级运动建模相关的时间自适应多帧插值
All at Once: Temporally Adaptive Multi-Frame Interpolation with Advanced Motion Modeling
论文作者
论文摘要
高刷新率的最新进展以及高慢动作和上框架的高率提高了对高效且具有成本效益的多帧视频插值解决方案的需求。在这方面,连续视频帧之间插入多个帧对于消费电子行业至关重要。最新的方法是当时将一帧插值的迭代解决方案。他们引入了暂时性的不一致和明显的视觉文物。 这项工作偏离了最先进的作品,引入了真正的多帧插装器。它利用时间域中的金字塔样式网络来完成一声多帧的插值任务。当遇到复杂的运动段时,使用松弛的损耗函数和基于立方体的高级运动模型的新型流动估计程序也用于进一步提高插值精度。 ADOBE240数据集上的结果表明,该提出的方法生成视觉上令人愉悦的,时间一致的框架,在PSNR中以1.57DB的速度优于当前最佳的现成方法,型号较小8倍,速度较小7.7倍。该提出的方法可以轻松扩展以插入大量新框架,同时由于单发机制而保持效率。
Recent advances in high refresh rate displays as well as the increased interest in high rate of slow motion and frame up-conversion fuel the demand for efficient and cost-effective multi-frame video interpolation solutions. To that regard, inserting multiple frames between consecutive video frames are of paramount importance for the consumer electronics industry. State-of-the-art methods are iterative solutions interpolating one frame at the time. They introduce temporal inconsistencies and clearly noticeable visual artifacts. Departing from the state-of-the-art, this work introduces a true multi-frame interpolator. It utilizes a pyramidal style network in the temporal domain to complete the multi-frame interpolation task in one-shot. A novel flow estimation procedure using a relaxed loss function, and an advanced, cubic-based, motion model is also used to further boost interpolation accuracy when complex motion segments are encountered. Results on the Adobe240 dataset show that the proposed method generates visually pleasing, temporally consistent frames, outperforms the current best off-the-shelf method by 1.57db in PSNR with 8 times smaller model and 7.7 times faster. The proposed method can be easily extended to interpolate a large number of new frames while remaining efficient because of the one-shot mechanism.