论文标题
迈向分布之外的顺序事件预测:因果治疗
Towards Out-of-Distribution Sequential Event Prediction: A Causal Treatment
论文作者
论文摘要
顺序事件预测的目的是根据一系列历史事件估算下一个事件,并应用了顺序建议,用户行为分析和临床治疗。实际上,下一个事实预测模型经过一次收集的顺序数据培训,需要在远程将来推广到新到达的序列,这需要模型来处理从训练到测试的时间分布转移。在本文中,我们首先采用一个生成数据的观点,以揭示一个负面结果,即由于潜在上下文混杂因素而导致的最大似然估计的现有方法将失败,即历史事件和下一个事件的共同原因。然后,我们基于后门调整和进一步利用变异推理来设计一个新的学习目标,以使其可用于序列学习问题。最重要的是,我们提出了一个具有分层分支结构的框架,用于学习上下文特定的表示。关于不同任务的全面实验(例如,顺序建议)证明了我们方法的有效性,适用性和可扩展性,以各种现成模型为骨架。
The goal of sequential event prediction is to estimate the next event based on a sequence of historical events, with applications to sequential recommendation, user behavior analysis and clinical treatment. In practice, the next-event prediction models are trained with sequential data collected at one time and need to generalize to newly arrived sequences in remote future, which requires models to handle temporal distribution shift from training to testing. In this paper, we first take a data-generating perspective to reveal a negative result that existing approaches with maximum likelihood estimation would fail for distribution shift due to the latent context confounder, i.e., the common cause for the historical events and the next event. Then we devise a new learning objective based on backdoor adjustment and further harness variational inference to make it tractable for sequence learning problems. On top of that, we propose a framework with hierarchical branching structures for learning context-specific representations. Comprehensive experiments on diverse tasks (e.g., sequential recommendation) demonstrate the effectiveness, applicability and scalability of our method with various off-the-shelf models as backbones.