Implementation of a Random forest algorithm model to predict stock prices for the S&P 500 index.
Here's a quick walkthrough on what to expect to do in this project:)
- Data Preprocessing
- Load data using yfinance library and data cleaning techniques.
- Create a target column for training the model
- Model Training
- Create Random Forrest Regressor model with
- Tune parameters for increased accuracy on predictions
- Train the model using predefined training set and continiously evalute for accuracy score, f1 score, and classification reports
- Backtesting
- Develop a backtesting algorithm to evaluate model performance using large historical data (10+ years)
- Gather data from backtesting process for evaluation using the earlier mentioned metrics
- Model Improvement
- Create new model parameters to improve the performace of the model
- Create new set of predictor variables to increase reliability and accuracy
- Reporting
- Report model performance and reasoning behind improvement techniques adopted