论文标题
加强学习应用于交易系统:一项调查
Reinforcement Learning Applied to Trading Systems: A Survey
论文作者
论文摘要
金融领域的任务,例如在市场交易中进行交易,具有挑战性,长期以来吸引了研究人员。最近的成就和随之而来的强化学习(RL)也增加了其在交易任务中的采用。 RL使用具有完善的正式概念的框架,这提高了其在学习有利可图的交易策略方面的吸引力。但是,在财务领域没有适当关注的RL使用可以防止新研究人员遵循标准或未能采用相关的概念准则。在这项工作中,我们涵盖了开创性的RL技术基础,概念和建议,以进行统一的,理论上的研究和对先前研究的比较,这些研究可以作为研究领域的结构指南。根据我们的分类,对29篇文章进行了审查,该文章考虑了RL最常见的配方和设计模式,这些模式是通过大量可用研究的。此分类允许精确检查有关数据输入,预处理,状态和动作组成,采用的RL技术,评估设置和总体结果的最相关方面。我们围绕基本RL概念组织的分析方法,可以清楚地识别当前系统设计的最佳实践,需要进一步研究的差距以及有希望的研究机会。最后,这篇综述试图通过促进研究人员对标准遵守的承诺,并帮助他们避免偏离RL Constructs的牢固基础,来促进这一研究领域的发展。
Financial domain tasks, such as trading in market exchanges, are challenging and have long attracted researchers. The recent achievements and the consequent notoriety of Reinforcement Learning (RL) have also increased its adoption in trading tasks. RL uses a framework with well-established formal concepts, which raises its attractiveness in learning profitable trading strategies. However, RL use without due attention in the financial area can prevent new researchers from following standards or failing to adopt relevant conceptual guidelines. In this work, we embrace the seminal RL technical fundamentals, concepts, and recommendations to perform a unified, theoretically-grounded examination and comparison of previous research that could serve as a structuring guide for the field of study. A selection of twenty-nine articles was reviewed under our classification that considers RL's most common formulations and design patterns from a large volume of available studies. This classification allowed for precise inspection of the most relevant aspects regarding data input, preprocessing, state and action composition, adopted RL techniques, evaluation setups, and overall results. Our analysis approach organized around fundamental RL concepts allowed for a clear identification of current system design best practices, gaps that require further investigation, and promising research opportunities. Finally, this review attempts to promote the development of this field of study by facilitating researchers' commitment to standards adherence and helping them to avoid straying away from the RL constructs' firm ground.