论文标题
具有基于信息的强化学习的智能电表中的隐私成本管理
Privacy-Cost Management in Smart Meters with Mutual Information-Based Reinforcement Learning
论文作者
论文摘要
物联网(IoT)范式的快速发展和扩展已大大增加了传感器和系统之间数据的收集和交换,这种现象引起了某些领域的严重隐私问题。尤其是,智能电表(SMS)与公用事业提供商共享家庭用户消耗的细粒度消耗,因为数据通过数据泄漏,可能会侵犯用户的隐私。为了增强隐私,电力消费者可以利用物理资源的可用性,例如可充电电池(RB),以塑造其电力需求,如隐私成本管理单元(PCMU)所决定的。在本文中,我们提出了一种新的方法,可以使用深度强化学习(DRL)学习PCMU策略。我们在用户的需求负载和电网所看到的掩盖负载之间采用共同信息(MI)作为可靠的一般隐私措施。与以前的研究不同,我们对数据中的整个时间相关性进行了建模,以以其一般形式学习MI,并使用神经网络估算基于MI的奖励信号来指导PCMU学习过程。该方法与一种无模型的DRL算法结合使用,称为深Q学习方法(DDQL)方法。完整的DDQL-MI算法的性能使用实际的SMS数据集进行了经验评估,并将其与更简单的隐私度量进行了比较。我们的结果表明,与最新的隐私意识需求塑造方法相比,有了显着改善。
The rapid development and expansion of the Internet of Things (IoT) paradigm has drastically increased the collection and exchange of data between sensors and systems, a phenomenon that raises serious privacy concerns in some domains. In particular, Smart Meters (SMs) share fine-grained electricity consumption of households with utility providers that can potentially violate users' privacy as sensitive information is leaked through the data. In order to enhance privacy, the electricity consumers can exploit the availability of physical resources such as a rechargeable battery (RB) to shape their power demand as dictated by a Privacy-Cost Management Unit (PCMU). In this paper, we present a novel method to learn the PCMU policy using Deep Reinforcement Learning (DRL). We adopt the mutual information (MI) between the user's demand load and the masked load seen by the power grid as a reliable and general privacy measure. Unlike previous studies, we model the whole temporal correlation in the data to learn the MI in its general form and use a neural network to estimate the MI-based reward signal to guide the PCMU learning process. This approach is combined with a model-free DRL algorithm known as the Deep Double Q-Learning (DDQL) method. The performance of the complete DDQL-MI algorithm is assessed empirically using an actual SMs dataset and compared with simpler privacy measures. Our results show significant improvements over state-of-the-art privacy-aware demand shaping methods.