论文标题
探索实时视频显着对象检测的丰富有效的空间时间互动
Exploring Rich and Efficient Spatial Temporal Interactions for Real Time Video Salient Object Detection
论文作者
论文摘要
当前的主流方法主要来自两个独立的场所,即空间和时间分支。作为互补组件,时间分支的主要任务是间歇性地将空间分支集中在那些具有显着运动的区域上。通过这种方式,即使整体视频显着质量在很大程度上取决于其空间分支,但颞分支的性能仍然很重要。因此,提高整体视频显着性的关键因素是如何有效地进一步提高这些分支的性能。在本文中,我们提出了一个新颖的时空网络,以完全互动方式实现这种改进。我们将轻量级的时间模型集成到空间分支中,以使那些与值得信赖的显着运动相关的空间显着区域。同时,空间分支本身能够以多尺度的方式将时间模型重新完善。这样,空间和时间分支都能够相互相互作用,从而实现了相互的绩效提高。我们的方法易于实施但有效,可以实现50 fps的实时速度实现高质量的视频检测。
The current main stream methods formulate their video saliency mainly from two independent venues, i.e., the spatial and temporal branches. As a complementary component, the main task for the temporal branch is to intermittently focus the spatial branch on those regions with salient movements. In this way, even though the overall video saliency quality is heavily dependent on its spatial branch, however, the performance of the temporal branch still matter. Thus, the key factor to improve the overall video saliency is how to further boost the performance of these branches efficiently. In this paper, we propose a novel spatiotemporal network to achieve such improvement in a full interactive fashion. We integrate a lightweight temporal model into the spatial branch to coarsely locate those spatially salient regions which are correlated with trustworthy salient movements. Meanwhile, the spatial branch itself is able to recurrently refine the temporal model in a multi-scale manner. In this way, both the spatial and temporal branches are able to interact with each other, achieving the mutual performance improvement. Our method is easy to implement yet effective, achieving high quality video saliency detection in real-time speed with 50 FPS.