sumgraph：通过递归图建模的视频摘要

论文标题

sumgraph：通过递归图建模的视频摘要

SumGraph: Video Summarization via Recursive Graph Modeling

论文作者

Park, Jungin, Lee, Jiyoung, Kim, Ig-Jae, Sohn, Kwanghoon

论文摘要

视频摘要的目的是选择视觉上不同的关键帧，并且可以代表输入视频的整个故事。视频摘要的最先进方法主要通过汇总所有重量的所有帧来将任务视为框架的关键帧选择问题。但是，要查找视频中有益的部分，有必要考虑视频的所有帧如何相互关联。为此，我们将视频摘要作为图形建模问题。我们提出了用于视频摘要的递归图形建模网络，称为sumgraph，以表示关系图，其中帧被视为节点，节点是通过帧之间的语义关系连接的。我们的网络通过一种递归方法来完成此操作，以优化最初估计的图表，以通过图形卷积网络对图表表示，将每个节点正确分类为键框。为了在更实用的环境中利用Sumgraph，我们还提供了一种以无监督的方式调整我们的图形建模的方法。在Sumgraph的情况下，我们在几个基准测试基准上实现了最先进的表现，以进行视频摘要，以监督和无监督的举止。

The goal of video summarization is to select keyframes that are visually diverse and can represent a whole story of an input video. State-of-the-art approaches for video summarization have mostly regarded the task as a frame-wise keyframe selection problem by aggregating all frames with equal weight. However, to find informative parts of the video, it is necessary to consider how all the frames of the video are related to each other. To this end, we cast video summarization as a graph modeling problem. We propose recursive graph modeling networks for video summarization, termed SumGraph, to represent a relation graph, where frames are regarded as nodes and nodes are connected by semantic relationships among frames. Our networks accomplish this through a recursive approach to refine an initially estimated graph to correctly classify each node as a keyframe by reasoning the graph representation via graph convolutional networks. To leverage SumGraph in a more practical environment, we also present a way to adapt our graph modeling in an unsupervised fashion. With SumGraph, we achieved state-of-the-art performance on several benchmarks for video summarization in both supervised and unsupervised manners.

下载PDF全文

下载文献需遵守相关版权规定

论文标题