论文标题
通过贝叶斯标签过渡流动在线社交网络
Deperturbation of Online Social Networks via Bayesian Label Transition
论文作者
论文摘要
在线社交网络(OSN)根据其在线活动和兴趣将用户分为不同的类别,该任务称为节点分类任务。可以使用图形卷积网络(GCN)有效地解决此类任务。但是,少数用户(所谓的扰动器)可能会在OSN上进行随机活动,这大大恶化了基于GCN的基于GCN的节点分类任务。朝这个方向的现有作品是通过对抗训练或通过识别攻击者节点随后将其删除来捍卫GCN的。但是,这两种方法都要求首先确定攻击模式或攻击者节点,这在扰动器节点数量很小时很难。在这项工作中,我们开发了GCN防御模型,即使用标签过渡的概念。 Graphlt假定扰动者的随机活动会恶化GCN的性能。为了克服这个问题,GraphLT随后使用了一种新型的贝叶斯标签过渡模型,该模型采用GCN的预测标签,并通过基于Gibbs-Sampling的推理应用标签过渡,从而修复GCN的预测以实现更好的节点分类。在七个基准数据集上进行的大量实验表明,GraphLT在不受干扰的环境中大大提高了节点分类器的性能。此外,它验证了GraphLT可以成功地修复具有比几种竞争方法的性能优越的基于GCN的节点分类器。
Online social networks (OSNs) classify users into different categories based on their online activities and interests, a task which is referred as a node classification task. Such a task can be solved effectively using Graph Convolutional Networks (GCNs). However, a small number of users, so-called perturbators, may perform random activities on an OSN, which significantly deteriorate the performance of a GCN-based node classification task. Existing works in this direction defend GCNs either by adversarial training or by identifying the attacker nodes followed by their removal. However, both of these approaches require that the attack patterns or attacker nodes be identified first, which is difficult in the scenario when the number of perturbator nodes is very small. In this work, we develop a GCN defense model, namely GraphLT, which uses the concept of label transition. GraphLT assumes that perturbators' random activities deteriorate GCN's performance. To overcome this issue, GraphLT subsequently uses a novel Bayesian label transition model, which takes GCN's predicted labels and applies label transitions by Gibbs-sampling-based inference and thus repairs GCN's prediction to achieve better node classification. Extensive experiments on seven benchmark datasets show that GraphLT considerably enhances the performance of the node classifier in an unperturbed environment; furthermore, it validates that GraphLT can successfully repair a GCN-based node classifier with superior performance than several competing methods.