论文标题

以基于Q学习的动态定价,前景理论启发的自动化P2P能源交易

Prospect Theory-inspired Automated P2P Energy Trading with Q-learning-based Dynamic Pricing

论文作者

Timilsina, Ashutosh, Silvestri, Simone

论文摘要

分布式能源的广泛采用以及智能电网技术的出现使传统上被动的电力系统用户能够积极参与能源交易。认识到传统的集中式电网驱动能源市场为这些用户提供最低利润的事实,最近的研究将注意力转移到了分散的点对点(P2P)能源市场上。在这些市场中,用户彼此交易能源,比买卖网格的收益更高。但是,假设持续的可用性,参与和完全合规性,大多数P2P能源交易中的研究很大程度上忽略了交易过程中用户的看法。结果,这些方法可能会导致负面的态度和随着时间的推移参与度的减少。在本文中,我们设计了一个自动化的P2P能源市场,该市场将用户感知考虑在内。我们采用前景理论来对用户的看法进行建模并制定优化框架,以最大程度地提高买方的看法,同时匹配需求和生产。鉴于优化问题的非线性和非凸性性质,我们提出了基于差异进化的算法,用于交易能源,称为辩论。此外,我们引入了一种具有风险敏感的Q学习算法,该算法用Q学习和风险敏感性(PQR)称为定价机制,该算法(PQR)了解了考虑其感知到的实用程序的卖方的最佳价格。基于真正的能源消耗和生产的实际痕迹以及现实的前景理论的功能,表明我们的方法可为买家带来26%的感知价值,并且与最近的最新方法相比,卖方的奖励增加了7%。

The widespread adoption of distributed energy resources, and the advent of smart grid technologies, have allowed traditionally passive power system users to become actively involved in energy trading. Recognizing the fact that the traditional centralized grid-driven energy markets offer minimal profitability to these users, recent research has shifted focus towards decentralized peer-to-peer (P2P) energy markets. In these markets, users trade energy with each other, with higher benefits than buying or selling to the grid. However, most researches in P2P energy trading largely overlook the user perception in the trading process, assuming constant availability, participation, and full compliance. As a result, these approaches may result in negative attitudes and reduced engagement over time. In this paper, we design an automated P2P energy market that takes user perception into account. We employ prospect theory to model the user perception and formulate an optimization framework to maximize the buyer's perception while matching demand and production. Given the non-linear and non-convex nature of the optimization problem, we propose Differential Evolution-based Algorithm for Trading Energy called DEbATE. Additionally, we introduce a risk-sensitive Q-learning algorithm, named Pricing mechanism with Q-learning and Risk-sensitivity (PQR), which learns the optimal price for sellers considering their perceived utility. Results based on real traces of energy consumption and production, as well as realistic prospect theory functions, show that our approach achieves a 26% higher perceived value for buyers and generates 7% more reward for sellers, compared to a recent state of the art approach.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源