Skip to content
This repository has been archived by the owner on Apr 2, 2022. It is now read-only.

Design data models, build data warehouses and data lakes, automate data pipelines, and work with massive datasets.

Notifications You must be signed in to change notification settings

piushvaish/data-engineering-capstone-project

Repository files navigation

Project Summary

The project follows the follow steps:

Step 1: Scope the Project and Gather Data Step 2: Explore and Assess the Data Step 3: Define the Data Model Step 4: Run ETL to Model the Data Step 5: Complete Project Write Up

Technology Used

  1. Pandas
  2. PySpark

About

Design data models, build data warehouses and data lakes, automate data pipelines, and work with massive datasets.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published