基于动态选择预测器的深度强化学习投资组合模型

Deep Reinforcement Learning Portfolio Model Based on Dynamic Selectors

下载PDF

导出

摘要近年来,投资组合管理问题在人工智能领域得到了广泛的研究,但现有的基于深度学习的量化交易方法还存在一些问题。首先,对股票的预测模式单一,通常一个模型只能训练出一个交易专家,交易决策也仅根据模型预测结果作出;其次,模型使用的数据源相对单一,只考虑了股票自身数据,忽略了整个市场风险对股票的影响。针对上述问题,提出了基于动态选择预测器的强化学习模型(DSDRL)。该模型分为3部分,首先提取股票数据的特征并传入多个预测器中,针对不同的投资策略训练多个预测模型,用动态选择器得到当前最优预测结果;其次,利用市场环境评价模块对当前市场风险进行量化,得到合适的投资金额比例;最后,在前两个模块的基础上建立了一种深度强化学习模型模拟真实的交易环境,基于预测的结果和投资金额比例得到实际投资组合策略。文中使用中证500和标普500的日k线数据进行测试验证,结果表明,此模型在夏普率等指标上均优于其他参照模型。 In recent years,portfolio management problems have been extensively studied in the field of artificial intelligence,but there are some improvements in the existing quantitative trading methods based on deep learning.First of all,the prediction model of stocks is single,usually a model only trains a trading expert,and the decision of trading is only based on the prediction results of the model.Secondly,the data source used in the model is relatively single,only considering the stock’s own data,ignoring the impact of the entire market risk on the stock.Aiming at the above problems,a reinforcement learning model based on dynamic selection predictor(DSDRL)is proposed.The model is divided into three parts.Firstly,the characteristics of stock data are extracted and introduced into multiple predictors.Multiple prediction models are trained for different investment strategies,and the current optimal prediction results are obtained by dynamic selector.Secondly,the market environment evaluation module is used to quantify the current market risk and obtain the appropriate proportion of investment amount.Finally,based on the first two mo-dules,a deep reinforcement learning model is established to simulate the real trading environment,and the actual portfolio strategy is obtained based on the predicted results and the proportion of investment amount.In this paper,the daily k-line data of China Securities 500 and S&P 500 are used for test verification.The results show that the proposed model is superior to other refe-rence models in Sharpe rate and other indicators.

作者赵淼谢良林文静徐海蛟 ZHAO Miao;XIE Liang;LIN Wenjing;XU Haijiao(College of Science,Wuhan University of Technology,Wuhan 430070,China;School of Computer Science,Guangdong University of Education,Guangzhou 510303,China)

机构地区武汉理工大学理学院广东第二师范学院计算机学院

出处《计算机科学》 CSCD 北大核心 2024年第4期344-352,共9页 Computer Science

基金广东省自然科学基金(2020A1515011208) 广州市基础研究计划基础与应用基础研究项目(202102080353) 广东省普通高校自然科学类特色创新项目(2019KTSCX117)。

关键词强化学习 LSTM 投资组合股市预测神经网络 Reinforcement learning LSTM Investment portfolio Stock market forecast Neural networks

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1朱子敬,何利文.基于ABC-LSTM-GRU的时间序列分解与预测模型[J].软件工程,2024,27(3):58-62. 被引量：1
2翁富.连板股共同的开盘走势特征[J].股市动态分析,2024(1):63-63.
3张飞鹏,徐一雄,陈曦,周勇.基于新闻文本情绪的区间值股票回报预测研究[J].计量经济学报,2024,4(1):204-230. 被引量：2
4鲍志,姚宏亮,方帅,杨静,俞奎.基于多Agent传动关系的股市趋势预测[J].计算机工程,2024,50(3):267-276.
5许石英.论渐层式习作教学在小学语文教学中的应用[J].学苑教育,2024(8):79-81.
6李庆涛,吴琼,肖斯锐.推动碳期货发展,激发碳市场活力[J].环境保护,2024,52(6):43-47. 被引量：2
7李志强.基于多层次PDCA理论的EPC建设项目投资概算控制流程优化研究[J].福建建筑,2024(2):134-138.
8廖才波,杨金鑫,邱志斌,胡雄,曾清霖,黄智勇.一种基于夏普利值及油中溶解气体分析的可解释变压器故障诊断方法[J].电网技术,2024,48(4):1752-1761. 被引量：6
9王峰明.如何在社会主义实践中坚持和发展马克思的资本观——与荣兆梓先生商榷[J].当代经济研究,2024(2):55-71. 被引量：1
10查思晨.双性同体视域下对《名利场》中贝基命运的必然性的分析[J].今古文创,2024(14):10-12.

计算机科学

2024年第4期

浏览历史

内容加载中请稍等...

基于动态选择预测器的深度强化学习投资组合模型

相关作者

相关机构

相关主题

浏览历史