论文标题
有效检查按照图编码的事件日志对定时订单合规性规则
Efficient Checking of Timed Order Compliance Rules over Graph-encoded Event Logs
论文作者
论文摘要
验证对流程数据的合规性规则是业务流程管理的基本功能。多年来,对于不同类型的过程数据,即过程模型,运行时的过程事件数据以及代表历史执行的事件日志已经解决了问题。已经提出了几种方法来解决对过程日志的合规性检查。这些方法基于不同的数据模型和存储技术,包括关系数据库,图形数据库和专有格式。基于图的事件日志编码是一个有希望的方向,它将几个过程分析任务转换为基础图上的查询。合规性检查是一类此类分析任务。在本文中,我们认为将日志数据编码为图是不足以保证对该数据查询的有效处理。由于合规性检查的互动性质,效率很重要。因此,合规性检查将受益于数据的子线性扫描。此外,随着添加更多数据的添加,例如,新批次到达,数据大小应逐渐增长,以优化存储空间和查询时间。我们使用图形表示形式提出了两种编码方法,并在neo4j中实现,并在一系列特殊的查询中显示了这些编码的好处,即定时订单合规规则。与基线编码相比,我们的实验在查询时间内显示高达5倍的速度以及图形尺寸的3倍。
Validation of compliance rules against process data is a fundamental functionality for business process management. Over the years, the problem has been addressed for different types of process data, i.e., process models, process event data at runtime, and event logs representing historical execution. Several approaches have been proposed to tackle compliance checking over process logs. These approaches have been based on different data models and storage technologies including relational databases, graph databases, and proprietary formats. Graph-based encoding of event logs is a promising direction that turns several process analytics tasks into queries on the underlying graph. Compliance checking is one class of such analysis tasks. In this paper, we argue that encoding log data as graphs alone is not enough to guarantee efficient processing of queries on this data. Efficiency is important due to the interactive nature of compliance checking. Thus, compliance checking would benefit from sub-linear scanning of the data. Moreover, as more data are added, e.g., new batches of logs arrive, the data size should grow sub-linearly to optimize both the space of storage and time for querying. We propose two encoding methods using graph representation, realized in Neo4J, and show the benefits of these encoding on a special class of queries, namely timed order compliance rules. Compared to a baseline encoding, our experiments show up to 5x speed up in the querying time as well as a 3x reduction in the graph size.