Machine Learning Approaches to Picking A-Shares Stocks: A Comparative Analysis

Wenjun Wu, Jia You

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

This study explores the integration of advanced machine learning (ML) techniques and large language models (LLMs) in financial modeling, focusing on the Chinese stock market. It introduces the ChatGPT Score, an LLM-driven sentiment analysis factor, and compares the traditional Fama-French five-factor (FF5) model with its augmented version, FF5+ChatGPT Score. The research evaluates linear regression models against ML models, such as Random Forests, Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Category Boosting (CatBoost), within five- and six-factor frameworks. Empirical results show that the ChatGPT Score outperforms traditional sentiment tools like SnowNLP and improves the predictive accuracy of the FF5 model. Additionally, CatBoost and Random Forests demonstrate strong portfolio management capabilities. Statistical validation through retrospective analysis confirms the effectiveness of the models, while industry feedback highlights their practical value in investment strategies. However, the study acknowledges the limitations of current models and recommends future research on deep learning techniques to improve financial market analysis and predictive accuracy.
Original languageEnglish
Title of host publicationProceedings of 2025 Joint International Conference on Automation-Intelligence-Safety&International Symposium on Autonomous Systems
Number of pages8
Publication statusPublished - May 2025
Event2025 Joint International Conference on Automation-Intelligence-Safety&International Symposium on Autonomous Systems - Xian, China
Duration: 23 May 202525 May 2025
https://docs.qq.com/sheet/DVGdwaE94bnN6cktP

Publication series

NameProceedings of 2025 Joint International Conference on Automation-Intelligence-Safety&International Symposium on Autonomous Systems

Conference

Conference2025 Joint International Conference on Automation-Intelligence-Safety&International Symposium on Autonomous Systems
Abbreviated title2025 ICAIS&ISAS
Country/TerritoryChina
CityXian
Period23/05/2525/05/25
Internet address

Keywords

  • Chinese Stock Market
  • Financial Modeling
  • Fama-French Models
  • Machine Learning
  • Random Forests
  • XGBoost
  • LightGBM
  • CatBoost, Large Language Models
  • Sentiment Analysis
  • ChatGPT Score

Fingerprint

Dive into the research topics of 'Machine Learning Approaches to Picking A-Shares Stocks: A Comparative Analysis'. Together they form a unique fingerprint.

Cite this