论文标题
一阶段检测变压器
One-stage Action Detection Transformer
论文作者
论文摘要
在这项工作中,我们将解决方案介绍给Epic-Kitchens-100 2022动作检测挑战。提出了一阶段动作检测变压器(OADT)来对视频段的时间连接进行建模。借助OADT,可以同时识别类别和时间边界。在结合了从不同功能训练的多个OADT模型之后,我们的模型可以达到21.28 \%的动作图,并在动作检测挑战的测试集中排名第一。
In this work, we introduce our solution to the EPIC-KITCHENS-100 2022 Action Detection challenge. One-stage Action Detection Transformer (OADT) is proposed to model the temporal connection of video segments. With the help of OADT, both the category and time boundary can be recognized simultaneously. After ensembling multiple OADT models trained from different features, our model can reach 21.28\% action mAP and ranks the 1st on the test-set of the Action detection challenge.