论文标题
时间序列异常检测的无监督模型选择
Unsupervised Model Selection for Time-series Anomaly Detection
论文作者
论文摘要
时间序列中的异常检测具有广泛的实际应用。尽管文献中已经提出了许多异常检测方法,但最近的一项调查得出的结论是,在各种数据集中,没有一种方法是最准确的。更糟糕的是,在实践中很少有异常标签。在没有标签的情况下,为给定数据集选择最准确的模型的实际问题在文献中很少受到关注。本文回答了这个问题,即给出了一个未标记的数据集和一组候选异常检测器,我们如何选择最准确的模型?为此,我们确定了三类的替代(无监督)指标,即预测错误,模型中心性和注入合成异常的性能,并表明某些指标与标准监督的轴向检测性能指标高度相关,例如$ f_1 $得分,但对于各种程度而言。我们用多个不完美的替代指标来制定度量组合作为强大的等级聚集问题。然后,我们提供了拟议方法背后的理论理由。在多个现实世界数据集上进行的大规模实验表明,我们提出的无监督方法与基于部分标记的数据选择最准确的模型一样有效。
Anomaly detection in time-series has a wide range of practical applications. While numerous anomaly detection methods have been proposed in the literature, a recent survey concluded that no single method is the most accurate across various datasets. To make matters worse, anomaly labels are scarce and rarely available in practice. The practical problem of selecting the most accurate model for a given dataset without labels has received little attention in the literature. This paper answers this question i.e. Given an unlabeled dataset and a set of candidate anomaly detectors, how can we select the most accurate model? To this end, we identify three classes of surrogate (unsupervised) metrics, namely, prediction error, model centrality, and performance on injected synthetic anomalies, and show that some metrics are highly correlated with standard supervised anomaly detection performance metrics such as the $F_1$ score, but to varying degrees. We formulate metric combination with multiple imperfect surrogate metrics as a robust rank aggregation problem. We then provide theoretical justification behind the proposed approach. Large-scale experiments on multiple real-world datasets demonstrate that our proposed unsupervised approach is as effective as selecting the most accurate model based on partially labeled data.