论文标题
实时驱动程序监视系统通过模式和查看分析
Real-Time Driver Monitoring Systems through Modality and View Analysis
论文作者
论文摘要
已知驾驶员分心是道路事故的主要原因。虽然监视系统可以检测与非驾驶相关的活动并促进降低风险,但它们必须准确有效才能适用。不幸的是,最先进的方法优先考虑准确性,同时忽略延迟,因为它们利用了连续帧高度相似的跨视图和多模式视频。因此,在本文中,我们通过忽略视频框架之间的时间关系并研究每种传感方式在检测驱动器活动中的重要性,从而追求时间效率的检测模型。实验表明,1)我们提出的算法是实时的,并且与基于视频的模型相比,计算显着降低了,可以实现相似的性能(97.5 \%AUC-PR); 2)带有红外渠道的最高视图比任何其他单一模式都更有信息。此外,我们通过手动注释其测试集以实现多分类,从而增强了爸爸数据集。我们还彻底分析了视觉传感器类型的影响及其对每个类别预测的位置。代码和新标签将发布。
Driver distractions are known to be the dominant cause of road accidents. While monitoring systems can detect non-driving-related activities and facilitate reducing the risks, they must be accurate and efficient to be applicable. Unfortunately, state-of-the-art methods prioritize accuracy while ignoring latency because they leverage cross-view and multimodal videos in which consecutive frames are highly similar. Thus, in this paper, we pursue time-effective detection models by neglecting the temporal relation between video frames and investigate the importance of each sensing modality in detecting drives' activities. Experiments demonstrate that 1) our proposed algorithms are real-time and can achieve similar performances (97.5\% AUC-PR) with significantly reduced computation compared with video-based models; 2) the top view with the infrared channel is more informative than any other single modality. Furthermore, we enhance the DAD dataset by manually annotating its test set to enable multiclassification. We also thoroughly analyze the influence of visual sensor types and their placements on the prediction of each class. The code and the new labels will be released.