基于自动分类的非结构化日志中基于自我分类的异常检测

论文标题

基于自动分类的非结构化日志中基于自我分类的异常检测

Self-Attentive Classification-Based Anomaly Detection in Unstructured Logs

论文作者

Nedelkoski, Sasho, Bogatinovski, Jasmin, Acker, Alexander, Cardoso, Jorge, Kao, Odej

论文摘要

检测异常是计算机系统安全性和可靠性的必不可少的挖掘任务。日志是几乎每个计算机系统中异常检测方法的常见和主要数据源。他们收集了描述运行时系统状态的一系列重要事件。最近的研究主要集中在一级深度学习方法上，这些方法是针对预定义的非可行数值日志表示的。主要限制是，这些模型无法学习描述正常和异常日志之间语义差异的日志表示，从而导致对未见日志的概括不佳。我们提出了Logsy，这是一种基于分类的方法，可以通过一种将正常数据与辅助日志数据集区分出正常数据的方式来学习日志表示，从而可以通过Internet易于访问。这种异常检测方法背后的想法是，辅助数据集足以增强正常数据的表示形式，但可以进行多样化，以适应过度拟合和改善概括。我们提出了一个基于注意力的编码模型，具有新的超球损失函数。这使学习紧凑的日志表示可以捕获正常和异常日志之间的内在差异。从经验上讲，与先前的方法相比，我们在F1分数中的平均提高为0.25。为了研究logsy的属性，我们执行了其他实验，包括评估辅助数据大小的影响，专家知识的影响以及学习的日志表示的质量。结果表明，学习的表示形式提高了先前方法的性能，例如PCA，相对改善为28.2％。

The detection of anomalies is essential mining task for the security and reliability in computer systems. Logs are a common and major data source for anomaly detection methods in almost every computer system. They collect a range of significant events describing the runtime system status. Recent studies have focused predominantly on one-class deep learning methods on predefined non-learnable numerical log representations. The main limitation is that these models are not able to learn log representations describing the semantic differences between normal and anomaly logs, leading to a poor generalization of unseen logs. We propose Logsy, a classification-based method to learn log representations in a way to distinguish between normal data from the system of interest and anomaly samples from auxiliary log datasets, easily accessible via the internet. The idea behind such an approach to anomaly detection is that the auxiliary dataset is sufficiently informative to enhance the representation of the normal data, yet diverse to regularize against overfitting and improve generalization. We propose an attention-based encoder model with a new hyperspherical loss function. This enables learning compact log representations capturing the intrinsic differences between normal and anomaly logs. Empirically, we show an average improvement of 0.25 in the F1 score, compared to the previous methods. To investigate the properties of Logsy, we perform additional experiments including evaluation of the effect of the auxiliary data size, the influence of expert knowledge, and the quality of the learned log representations. The results show that the learned representation boost the performance of the previous methods such as PCA with a relative improvement of 28.2%.

下载PDF全文

下载文献需遵守相关版权规定

论文标题