论文标题
从嘈杂的图形结构中学习节点表示
Learning Node Representations from Noisy Graph Structures
论文作者
论文摘要
事实证明,学习图表上的低维表示在各种下游任务中都是有效的。但是,在现实世界中,噪音在很大程度上损害了网络,因为网络中的边缘通过整个网络而不是节点本身传播噪声。尽管现有的方法倾向于保留结构属性,但学到的表示对噪声的鲁棒性通常被忽略。在本文中,我们提出了一个新颖的框架来学习无噪声节点表示并同时消除噪声。由于在真实图上通常未知噪声,因此我们设计了两个生成器,即图形生成器和噪声发生器,以在无监督的设置中识别正常的结构和噪声。一方面,图形生成器是统一的方案,以合并任何有用的图形知识以生成正常结构。我们用社区结构和幂律学位分布作为示例来说明生成过程。另一方面,噪声发生器会生成图形噪声,不仅可以满足某些基本属性,而且以自适应方式满足。因此,可以成功处理具有任意分布的真实声音。最后,为了消除噪声并获得无噪声节点表示,需要共同优化两个发电机,并通过最大似然估计,我们将模型分别在真实的图和噪声上分别将不同的正规化约束施加了不同的正规化约束。我们的模型对现实世界和合成数据均进行了评估。对于节点分类和图形重建任务,它的表现优于其他强大的基线,证明了其消除图噪声的能力。
Learning low-dimensional representations on graphs has proved to be effective in various downstream tasks. However, noises prevail in real-world networks, which compromise networks to a large extent in that edges in networks propagate noises through the whole network instead of only the node itself. While existing methods tend to focus on preserving structural properties, the robustness of the learned representations against noises is generally ignored. In this paper, we propose a novel framework to learn noise-free node representations and eliminate noises simultaneously. Since noises are often unknown on real graphs, we design two generators, namely a graph generator and a noise generator, to identify normal structures and noises in an unsupervised setting. On the one hand, the graph generator serves as a unified scheme to incorporate any useful graph prior knowledge to generate normal structures. We illustrate the generative process with community structures and power-law degree distributions as examples. On the other hand, the noise generator generates graph noises not only satisfying some fundamental properties but also in an adaptive way. Thus, real noises with arbitrary distributions can be handled successfully. Finally, in order to eliminate noises and obtain noise-free node representations, two generators need to be optimized jointly, and through maximum likelihood estimation, we equivalently convert the model into imposing different regularization constraints on the true graph and noises respectively. Our model is evaluated on both real-world and synthetic data. It outperforms other strong baselines for node classification and graph reconstruction tasks, demonstrating its ability to eliminate graph noises.