使用用户功能平衡的DEBIAS建议

论文标题

使用用户功能平衡的DEBIAS建议

Debiased Recommendation with User Feature Balancing

论文作者

Yang, Mengyue, Cai, Guohao, Liu, Furui, Dong, Zhenhua, He, Xiuqiang, Hao, Jianye, Wang, Jun, Chen, Xu

论文摘要

Debias的建议最近引起了行业和学术社区的越来越多的关注。传统模型主要依赖于反向倾向得分（IP），这可能很难估计，并且可能遭受高方差问题。为了减轻这些问题，在本文中，我们提出了一个基于用户功能平衡的新型辩护推荐框架。一般的想法是引入投影函数以调整用户特征分布，以便理想的无偏学习目标可以由纯粹基于离线数据集的可解决的目标界定上限。在上限中，预计给定不同项目的预计用户分布将相等。从因果推理的角度来看，该要求旨在将因果关系从用户到该项目中删除，这使我们能够实现公正的建议，绕过IPS的计算。为了有效地平衡用户分布在每个项目对上，我们提出了三种策略，包括剪裁，抽样和对抗性学习以改善培训过程。为了获得更强大的优化，我们部署了一个明确的模型，以捕获推荐系统中潜在的潜在混杂因素。据我们所知，本文是基于混淆者平衡的DEBIAS推荐的第一部作品。在实验中，我们将框架与基于合成，半合成和现实世界数据集的许多最新方法进行了比较。广泛的实验表明，我们的模型有效促进推荐性能。

Debiased recommendation has recently attracted increasing attention from both industry and academic communities. Traditional models mostly rely on the inverse propensity score (IPS), which can be hard to estimate and may suffer from the high variance issue. To alleviate these problems, in this paper, we propose a novel debiased recommendation framework based on user feature balancing. The general idea is to introduce a projection function to adjust user feature distributions, such that the ideal unbiased learning objective can be upper bounded by a solvable objective purely based on the offline dataset. In the upper bound, the projected user distributions are expected to be equal given different items. From the causal inference perspective, this requirement aims to remove the causal relation from the user to the item, which enables us to achieve unbiased recommendation, bypassing the computation of IPS. In order to efficiently balance the user distributions upon each item pair, we propose three strategies, including clipping, sampling and adversarial learning to improve the training process. For more robust optimization, we deploy an explicit model to capture the potential latent confounders in recommendation systems. To the best of our knowledge, this paper is the first work on debiased recommendation based on confounder balancing. In the experiments, we compare our framework with many state-of-the-art methods based on synthetic, semi-synthetic and real-world datasets. Extensive experiments demonstrate that our model is effective in promoting the recommendation performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题