深度学习能够从大量原始数据中提取高级抽象特征而不依赖于先验知识,对于金融市场预测具有潜在的吸引力。基于"分解—重构—综合"的思想,提出了一种全新的深度学习预测方法论,并在此基础上构建了一种股票市场单步向前的深度...深度学习能够从大量原始数据中提取高级抽象特征而不依赖于先验知识,对于金融市场预测具有潜在的吸引力。基于"分解—重构—综合"的思想,提出了一种全新的深度学习预测方法论,并在此基础上构建了一种股票市场单步向前的深度学习复合预测模型——CEEMD-LSTM。在此模型中,序列平稳化分解模块的CEEMD能将时间序列中不同尺度的波动或趋势逐级分解出来,产生一系列不同特征尺度的本征模态函数(Intrinsic Mode Function,IMF);采用深度学习中适合处理时间序列的长短期记忆网络(Long-Short Term Memory,LSTM)分别对每个IMF与趋势项提取高级、深度特征,并预测下一交易日收盘价的收益率;最后,综合各个IMF分量以及趋势项的预测值,得到最终的预测值。基于3类不同发达程度股票市场的股票指数的实证结果表明,此模型在预测的两个维度即预测误差与预测命中率上均要优于其他参照模型。展开更多
股票市场的情绪可以在一定程度上反映投资者的行为并影响其投资决策。市场新闻作为一种非结构性数据,能够体现并引导市场的大环境情绪,与股票价格一同成为至关重要的市场参考数据,能够为投资者的投资决策提供有效帮助。文中提出了一种...股票市场的情绪可以在一定程度上反映投资者的行为并影响其投资决策。市场新闻作为一种非结构性数据,能够体现并引导市场的大环境情绪,与股票价格一同成为至关重要的市场参考数据,能够为投资者的投资决策提供有效帮助。文中提出了一种可以准确、快速地建立针对海量新闻数据的多维情绪特征向量化方法,利用支持向量机(Support Victor Machine,SVM)模型来预测金融新闻对股票市场的影响,并通过bootstrap来减轻过拟合问题。在沪深股指上进行实验的结果表明,相比于传统模型,所提方法能够将预测准确度提高约8%,并在3个月的回测实验中获得了6.52%的超额收益,证明了其有效性。展开更多
This paper examines the proxy variables of investor sentiment in Chinese stock market carefully, and tries to construct an investor sentiment index indirectly. We use cross correlation analysis to examine lead-lag rel...This paper examines the proxy variables of investor sentiment in Chinese stock market carefully, and tries to construct an investor sentiment index indirectly. We use cross correlation analysis to examine lead-lag relationship between the proxy variables and HS300 index. The results show that net added accounts (NAA), SSE share turnover (TURN), and closed-end fund discount (CEFD) are leading variables to stock market. The average first day return of IPOs (RIPO) and relative degree of active trading in equity market (RDAT) are contemporary variables, while number of IPOs (NIPO) is a lagging variable of stock market. Using the sentiment proxy variables with most possible leading order, and forward selection stepwise regression method, the empirical results on monthly stock returns reveal that three leading proxy variables can be used to form a sentiment index. And the out of sample tests prove that this sentiment index has good predictive power of Chinese stock market, and it is robust.展开更多
文摘深度学习能够从大量原始数据中提取高级抽象特征而不依赖于先验知识,对于金融市场预测具有潜在的吸引力。基于"分解—重构—综合"的思想,提出了一种全新的深度学习预测方法论,并在此基础上构建了一种股票市场单步向前的深度学习复合预测模型——CEEMD-LSTM。在此模型中,序列平稳化分解模块的CEEMD能将时间序列中不同尺度的波动或趋势逐级分解出来,产生一系列不同特征尺度的本征模态函数(Intrinsic Mode Function,IMF);采用深度学习中适合处理时间序列的长短期记忆网络(Long-Short Term Memory,LSTM)分别对每个IMF与趋势项提取高级、深度特征,并预测下一交易日收盘价的收益率;最后,综合各个IMF分量以及趋势项的预测值,得到最终的预测值。基于3类不同发达程度股票市场的股票指数的实证结果表明,此模型在预测的两个维度即预测误差与预测命中率上均要优于其他参照模型。
文摘股票市场的情绪可以在一定程度上反映投资者的行为并影响其投资决策。市场新闻作为一种非结构性数据,能够体现并引导市场的大环境情绪,与股票价格一同成为至关重要的市场参考数据,能够为投资者的投资决策提供有效帮助。文中提出了一种可以准确、快速地建立针对海量新闻数据的多维情绪特征向量化方法,利用支持向量机(Support Victor Machine,SVM)模型来预测金融新闻对股票市场的影响,并通过bootstrap来减轻过拟合问题。在沪深股指上进行实验的结果表明,相比于传统模型,所提方法能够将预测准确度提高约8%,并在3个月的回测实验中获得了6.52%的超额收益,证明了其有效性。
基金supported by the National Natural Science Foundation of China under Grant Nos.71003004 and 71373001
文摘This paper examines the proxy variables of investor sentiment in Chinese stock market carefully, and tries to construct an investor sentiment index indirectly. We use cross correlation analysis to examine lead-lag relationship between the proxy variables and HS300 index. The results show that net added accounts (NAA), SSE share turnover (TURN), and closed-end fund discount (CEFD) are leading variables to stock market. The average first day return of IPOs (RIPO) and relative degree of active trading in equity market (RDAT) are contemporary variables, while number of IPOs (NIPO) is a lagging variable of stock market. Using the sentiment proxy variables with most possible leading order, and forward selection stepwise regression method, the empirical results on monthly stock returns reveal that three leading proxy variables can be used to form a sentiment index. And the out of sample tests prove that this sentiment index has good predictive power of Chinese stock market, and it is robust.