论文标题
基于多元运动特征的整合行为识别
Behavior Recognition Based on the Integration of Multigranular Motion Features
论文作者
论文摘要
视频中对行为的识别通常需要对有关对象及其动态动作信息的空间信息进行组合分析。具体而言,行为识别甚至可能更多地依赖于包含短程和远程运动的时间信息的建模。这与涉及侧重于理解空间信息的图像的计算机视觉任务形成对比。但是,当前的解决方案无法共同和全面地分析视频中大规模的相邻帧和远程时间聚集之间的短距离运动。在本文中,我们提出了一种基于多氏(IMG)运动特征的整合的新型行为识别方法。特别是,我们通过基于通道注意的短期运动功能增强模块(CMEM)和级联的长期运动特征集成模块(CLIM)的协同作用实现可靠的运动信息建模。我们对几个动作识别基准(例如HMDB51,Soseings sosemething and ucf101)进行了评估。实验结果表明,我们的方法的表现优于先前的最新方法,这证实了其有效性和效率。
The recognition of behaviors in videos usually requires a combinatorial analysis of the spatial information about objects and their dynamic action information in the temporal dimension. Specifically, behavior recognition may even rely more on the modeling of temporal information containing short-range and long-range motions; this contrasts with computer vision tasks involving images that focus on the understanding of spatial information. However, current solutions fail to jointly and comprehensively analyze short-range motion between adjacent frames and long-range temporal aggregations at large scales in videos. In this paper, we propose a novel behavior recognition method based on the integration of multigranular (IMG) motion features. In particular, we achieve reliable motion information modeling through the synergy of a channel attention-based short-term motion feature enhancement module (CMEM) and a cascaded long-term motion feature integration module (CLIM). We evaluate our model on several action recognition benchmarks such as HMDB51, Something-Something and UCF101. The experimental results demonstrate that our approach outperforms the previous state-of-the-art methods, which confirms its effectiveness and efficiency.