论文标题
跨域局部特征增强的深层视频检测
Cross-Domain Local Characteristic Enhanced Deepfake Video Detection
论文作者
论文摘要
随着超现实的面对伪造技术的出现,由于安全关注,深击检测引起了越来越多的关注。尽管在已知的伪造方面表现出色,许多探测器仍无法检测到看不见的操作时无法获得准确的结果。在本文中,我们的动机是:真实视频和假视频之间的差异非常微妙和本地化,并且在各个信息域的某些关键面部地区可能存在不一致或不规则性。为此,我们提出了一条新型的管道,跨域的局部取证(XDLF),以进行更一般的深层视频检测。在拟议的管道中,提出了一个专门的框架,以同时利用空间,频率和时间域的局部伪造模式,从而学习跨域特征以检测伪造。此外,该框架利用了人脸的四个高级伪造的地方区域,以指导该模型增强微妙的文物并定位潜在的异常情况。在几个基准数据集上进行了广泛的实验证明了我们方法的令人印象深刻的性能,并且我们在跨数据集概括的几种最新方法上实现了优越性。我们还研究了通过消融促成其性能的因素,这表明利用跨域局部特征是开发更一般的深层探测器的一个值得注意的方向。
As ultra-realistic face forgery techniques emerge, deepfake detection has attracted increasing attention due to security concerns. Many detectors cannot achieve accurate results when detecting unseen manipulations despite excellent performance on known forgeries. In this paper, we are motivated by the observation that the discrepancies between real and fake videos are extremely subtle and localized, and inconsistencies or irregularities can exist in some critical facial regions across various information domains. To this end, we propose a novel pipeline, Cross-Domain Local Forensics (XDLF), for more general deepfake video detection. In the proposed pipeline, a specialized framework is presented to simultaneously exploit local forgery patterns from space, frequency, and time domains, thus learning cross-domain features to detect forgeries. Moreover, the framework leverages four high-level forgery-sensitive local regions of a human face to guide the model to enhance subtle artifacts and localize potential anomalies. Extensive experiments on several benchmark datasets demonstrate the impressive performance of our method, and we achieve superiority over several state-of-the-art methods on cross-dataset generalization. We also examined the factors that contribute to its performance through ablations, which suggests that exploiting cross-domain local characteristics is a noteworthy direction for developing more general deepfake detectors.