采样解释的生成对抗性推荐剂

论文标题

采样解释的生成对抗性推荐剂

Sampling-Decomposable Generative Adversarial Recommender

论文作者

Jin, Binbin, Lian, Defu, Liu, Zheng, Liu, Qi, Ma, Jianhui, Xie, Xing, Chen, Enhong

论文摘要

建议技术是减轻信息过载的重要方法。由于缺乏明确的负面样本，经常受到隐性用户反馈的培训，因此许多推荐人遭受了稀疏挑战。 GAN风格的推荐人（即Irgan）通过学习生成器和歧视器对手来应对挑战，从而使发电机生成越来越困难的样本，以使歧视器加速优化歧视目标。但是，从发电机中产生样品非常耗时，我们的经验研究表明，歧视器在TOP-K项目的建议中表现较差。为此，对GAN式算法进行了理论分析，表明极限能力的发生器与最佳发生器有所不同。这可能会解释歧视者表现的局限性。基于这些发现，我们提出了一种可取消分解的可生成的对抗推荐剂（SD-GAR）。在框架中，某些发电机和最佳之间的差异是通过自称的重要性抽样来补偿的。用采样解释的发生器提高了样品产生的效率，以便可以使用Vose-Alias方法在O（1）中生成每个样品。有趣的是，由于采样的可分解性，可以用封闭形式的解决方案以交替的方式优化发电机，这与GAN风格算法中的策略梯度不同。我们使用五个现实世界的推荐数据集对提出的算法进行了广泛的评估。结果表明，SD-GAR的表现优于Irgan 12.4％，而SOTA推荐的人平均超过10％。此外，在数据集上，具有超过12万个项目的歧视训练可以快20倍。

Recommendation techniques are important approaches for alleviating information overload. Being often trained on implicit user feedback, many recommenders suffer from the sparsity challenge due to the lack of explicitly negative samples. The GAN-style recommenders (i.e., IRGAN) addresses the challenge by learning a generator and a discriminator adversarially, such that the generator produces increasingly difficult samples for the discriminator to accelerate optimizing the discrimination objective. However, producing samples from the generator is very time-consuming, and our empirical study shows that the discriminator performs poor in top-k item recommendation. To this end, a theoretical analysis is made for the GAN-style algorithms, showing that the generator of limit capacity is diverged from the optimal generator. This may interpret the limitation of discriminator's performance. Based on these findings, we propose a Sampling-Decomposable Generative Adversarial Recommender (SD-GAR). In the framework, the divergence between some generator and the optimum is compensated by self-normalized importance sampling; the efficiency of sample generation is improved with a sampling-decomposable generator, such that each sample can be generated in O(1) with the Vose-Alias method. Interestingly, due to decomposability of sampling, the generator can be optimized with the closed-form solutions in an alternating manner, being different from policy gradient in the GAN-style algorithms. We extensively evaluate the proposed algorithm with five real-world recommendation datasets. The results show that SD-GAR outperforms IRGAN by 12.4% and the SOTA recommender by 10% on average. Moreover, discriminator training can be 20x faster on the dataset with more than 120K items.

下载PDF全文

下载文献需遵守相关版权规定

论文标题