Skip to content

Latest commit

 

History

History
189 lines (142 loc) · 8.74 KB

README.md

File metadata and controls

189 lines (142 loc) · 8.74 KB

SC1015 Data Science Project — Crypto-Genie

This project aims to effectively mitigate the volatility of cryptocurrency investment by evaluating the media hype from Google Trends in our prediction models.


Project Cover


🛠️ Installation and Set Up

  • Clone repository

    git clone https://github.com/crystalcheong/crypto-genie.git
    
  • Install dependencies with pip

    $ pip install -r requirements.txt
    

📂 Project Structure
📦crypto-genie
 ┣ 📂data
 ┃ ┣ 📂searchTrends
 ┃ ┣ 📜BTC-SearchTrend.csv
 ┃ ┗ 📜README.md
 ┣ 📂metrics
 ┃ ┣ 📜README.md
 ┣ 📂models
 ┣ 📜0_DataScraper.ipynb
 ┣ 📜1_DataAnalysis.ipynb
 ┣ 📜2_UnivariateForecast.ipynb
 ┣ 📜3_MultivariateForecast.ipynb
 ┣ 📜README.md
 ┗ 📜requirements.txt

/data - stores all the collected data to be utilized
/metrics - contains the exported measurement of accuracy & efficacy
/models - contains the exported pre-trained models


📑 Data Sources


🧭 In-depth Documentation


📚 Notebooks Overview

Each notebook is prefixed with the chronological order of the analysis pipeline and can be executed as a standalone.


🗝️ Project Takeaways

In conclusion, the inclusion of media hype from Google Trends resulted in a more accurate multivariate forecasting. That said, as the Google Trends search percentile is not the only definitive measurement of media hype, the accuracy and efficacy of the prediction models can be further enhanced by the integration of a multi-faceted data curation from other sources such as the users' sentiments from Reddit and Twitter

  • Learning Outcomes
    • Interpret stock information
    • Develop & evaluate time series forecasting machine learning models such as
      • Rolling-Forecast ARIMA
      • XGBRegressor
      • LSTM
    • Utilized external Python libraries such as plotly, statsmodels and tensorflow
    • Collaboration on Google Colab and git repository management with Github

🧰 Languages & Tools

  • Languages
    Python

  • Libraries, Packages
    Numpy Pandas Scikit Learn TensorFlow Plotly

  • Tools, IDE
    Github Jupyter Google Colab


Contributors ✨


Crystal Cheong


Yue Chong


Jared Chan

  • Data Collection & Preparation
  • Data Analysis
  • Machine Learning
  • Repository Documentation
  • Presentation Slides & Script
  • Data Analysis
  • Data Visualisations
  • Presentation Slides & Script
  • Video Presenter
  • Machine Learning
  • Presentation Slides & Script
  • Video Presenter

  • 💡 References

    This repository is submitted as a project work for Nanyang Technological University's SC1015- Data Science and Artificial Intelligence course.