基于文本生成语言模型的股指预测

doi:10.15888/j.cnki.csa.009266

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月18日 2:59 星期五

首页 > 过刊浏览>2023年第32卷第10期 >54-64. DOI:10.15888/j.cnki.csa.009266

PDF HTML阅读 XML下载导出引用引用提醒

基于文本生成语言模型的股指预测
DOI:
                        10.15888/j.cnki.csa.009266
                    
CSTR:
                        
                    
作者:
                        温灿红温灿红
中国科学技术大学 管理学院, 合肥 230026
在期刊界中查找
在百度中查找
在本站中查找
陈思陈思
中国科学技术大学 管理学院, 合肥 230026
在期刊界中查找
在百度中查找
在本站中查找
杨海生杨海生
中山大学 岭南学院, 广州 510275
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金面上项目(72173141); 广东省自然科学基金面上项目(2023A1515012434)

Stock Index Prediction with Text Generative Language Model

Author:

WEN Can-Hong
WEN Can-Hong
School of Management, University of Science and Technology of China, Hefei 230026, China
在期刊界中查找
在百度中查找
在本站中查找
CHEN Si
CHEN Si
School of Management, University of Science and Technology of China, Hefei 230026, China
在期刊界中查找
在百度中查找
在本站中查找
YANG Hai-Sheng
YANG Hai-Sheng
Lingnan College, Sun Yat-sen University, Guangzhou 510275, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

股指预测是金融领域中一个重要课题. 随着计算能力和技术的发展, 从在线新闻中识别和量化有价值的信息为提高股指预测表现创造了机会. 本文为将关于股票指数预测框架的计量经济学文献扩展到高维文本数据提出了一种基于生成语言模型的股票指数预测框架. 该预测框架可以分为两个步骤. 首先, 使用有监督生成语言模型快速过滤噪声词语, 并将剩余文本聚合成可以充分解释股指变动的新闻指数. 其次, 将该新闻指数和历史股指数据共同作为时变参数预测模型的自变量来预测股指未来价值. 该框架不仅丰富了股票指数预测的影响因素并且揭示了这些因素与股票指数价值之间的时变动态关系. 实证研究展示了该预测框架解释能力和样本外预测能力. 在预测的6个行业股指中, 本文提出的预测框架得到的均方误差普遍小于传统时间序列和机器学习方法. 与没有考虑新闻信息的时变参数预测模型和长短期记忆网络相比该预测框架也表现了更好的预测性能.

Abstract:

Stock index prediction is an important topic in the field of finance. With the development of computing power and technologies, there are opportunities to improve the performance of stock index prediction by identifying and quantifying valuable information from online news. In order to extend the econometric literature on stock index prediction frameworks to high-dimensional textual data, a stock index prediction framework based on generative language models is proposed. The prediction framework can be divided into two steps. First, a supervised generative language model is used to filter out noisy words quickly and aggregate the remaining text into a news index that can fully explain stock index changes. Second, the news index and historical stock index data are jointly used as independent variables of the time-varying parameter predictive model to predict future stock index values. The framework not only enriches the influencing factors of stock index prediction but also reveals the time-varying dynamic relationship between these factors and stock index values. Empirical research demonstrates the explanatory and out-of-sample predictive power of the proposed prediction framework. Among the six industrial stock indices predicted, the mean square error obtained by the proposed prediction framework is generally lower than that by traditional time series and machine learning methods. Compared with the time-varying parameter predictive model and long short-term memory model that do not consider news information, the proposed prediction framework also exhibits better predictive performance.

引用本文

温灿红,陈思,杨海生.基于文本生成语言模型的股指预测.计算机系统应用,2023,32(10):54-64

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-03-28
最后修改日期:2023-05-06
录用日期:
在线发布日期: 2023-08-09
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码