论文标题
自我监督的图表学习推荐
Self-supervised Graph Learning for Recommendation
论文作者
论文摘要
用于推荐的用户项目图上的表示学习已经从使用单个ID或交互历史记录到利用高阶邻居的发展发展。这导致了图形卷积网络(GCN)的成功,例如Pinsage和LightGCN。尽管有效,我们认为它们遭受了两个局限性:(1)高度节点对代表学习产生更大的影响,从而恶化了低度(长尾)项目的建议; (2)代表很容易受到嘈杂的相互作用,因为邻里聚合方案进一步扩大了观察到的边缘的影响。 在这项工作中,我们探索了在用户项目图上的自我监督学习,以提高GCN的准确性和鲁棒性以供推荐。这个想法是通过辅助自我监督的任务来补充经典的监督建议,从而通过自我歧视加强节点表示学习。具体而言,我们生成节点的多个视图,与其他节点相比,最大化同一节点的不同视图之间的一致性。我们设计三个操作员来生成视图 - 节点辍学,边缘辍学和随机步行 - 以不同的方式改变图形结构。我们将此新的学习范式称为\ textit {自我监督的图形学习}(SGL),在最新的模型LightGCN上实现。通过理论分析,我们发现SGL具有自动开采硬否负面因素的能力。对三个基准数据集的实证研究证明了SGL的有效性,该研究提高了建议准确性,尤其是在长尾项目上,以及针对相互作用噪声的鲁棒性。我们的实现可在\ url {https://github.com/wujcan/sgl}上获得。
Representation learning on user-item graph for recommendation has evolved from using single ID or interaction history to exploiting higher-order neighbors. This leads to the success of graph convolution networks (GCNs) for recommendation such as PinSage and LightGCN. Despite effectiveness, we argue that they suffer from two limitations: (1) high-degree nodes exert larger impact on the representation learning, deteriorating the recommendations of low-degree (long-tail) items; and (2) representations are vulnerable to noisy interactions, as the neighborhood aggregation scheme further enlarges the impact of observed edges. In this work, we explore self-supervised learning on user-item graph, so as to improve the accuracy and robustness of GCNs for recommendation. The idea is to supplement the classical supervised task of recommendation with an auxiliary self-supervised task, which reinforces node representation learning via self-discrimination. Specifically, we generate multiple views of a node, maximizing the agreement between different views of the same node compared to that of other nodes. We devise three operators to generate the views -- node dropout, edge dropout, and random walk -- that change the graph structure in different manners. We term this new learning paradigm as \textit{Self-supervised Graph Learning} (SGL), implementing it on the state-of-the-art model LightGCN. Through theoretical analyses, we find that SGL has the ability of automatically mining hard negatives. Empirical studies on three benchmark datasets demonstrate the effectiveness of SGL, which improves the recommendation accuracy, especially on long-tail items, and the robustness against interaction noises. Our implementations are available at \url{https://github.com/wujcan/SGL}.