成本敏感的投资组合通过深度加强学习选择

论文标题

成本敏感的投资组合通过深度加强学习选择

Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning

论文作者

Zhang, Yifan, Zhao, Peilin, Wu, Qingyao, Li, Bin, Huang, Junzhou, Tan, Mingkui

论文摘要

投资组合选择是一项重要的现实财务任务，并在人工智能社区中引起了广泛的关注。但是，这项任务有两个主要困难：（i）非平稳价格序列和复杂的资产相关性使得对特征表示的学习非常困难；（ii）金融市场的实用性原则需要控制交易和风险成本。大多数现有方法采用手工艺功能和/或不考虑成本的限制，这可能会使它们的性能不令人满意，并且无法控制实践中的这两个费用。在本文中，我们提出了一种具有深入强化学习的成本敏感投资组合选择方法。具体而言，设计了一种新颖的两流投资组合策略网络，以提取价格序列模式和资产相关性，而开发了一种新的成本敏感奖励功能，以最大程度地提高累计回报，并通过增强学习来限制这两种成本。我们从理论上分析了拟议奖励的近距离观点，这表明有关此奖励功能的政策的增长率可以接近理论上的最佳效果。我们还在实证上评估了现实世界数据集的建议方法。有希望的结果证明了拟议方法在盈利能力，成本敏感性和表示能力方面的有效性和优势。

Portfolio Selection is an important real-world financial task and has attracted extensive attention in artificial intelligence communities. This task, however, has two main difficulties: (i) the non-stationary price series and complex asset correlations make the learning of feature representation very hard; (ii) the practicality principle in financial markets requires controlling both transaction and risk costs. Most existing methods adopt handcraft features and/or consider no constraints for the costs, which may make them perform unsatisfactorily and fail to control both costs in practice. In this paper, we propose a cost-sensitive portfolio selection method with deep reinforcement learning. Specifically, a novel two-stream portfolio policy network is devised to extract both price series patterns and asset correlations, while a new cost-sensitive reward function is developed to maximize the accumulated return and constrain both costs via reinforcement learning. We theoretically analyze the near-optimality of the proposed reward, which shows that the growth rate of the policy regarding this reward function can approach the theoretical optimum. We also empirically evaluate the proposed method on real-world datasets. Promising results demonstrate the effectiveness and superiority of the proposed method in terms of profitability, cost-sensitivity and representation abilities.

下载PDF全文

下载文献需遵守相关版权规定

论文标题