Serverless ETL using AWS Glue

In this use case, we have developed a sample data pipeline (Glue Job) using the AWS typescript SDK, which will read the data from a dynamo DB table, perform some data transformation using PySpark and write it into an S3 bucket in CSV format.

DynamoDB is a fully managed NoSQL database service offered by AWS, which is easily scalable and used in multiple applications. On the other hand, S3 is a general-purpose storage offering by AWS.

The cdk.json file tells the CDK Toolkit how to execute your app.

Useful commands

npm run build compile typescript to js
npm run watch watch for changes and compile
npm run test perform the jest unit tests
cdk deploy deploy this stack to your default AWS account/region
cdk diff compare deployed stack with current state
cdk synth emits the synthesized CloudFormation template

How to reproduce

Prerequisited - you should have an AWS account (free tier is enough) and AWS CLI should have already configured (https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html)

Clone the repository
Bootstap your AWS environment using - CDK Bootstrap
Deploy the stack using - CDK Deploy
Create dummy data in dynamoDB using the sample data
Run the Glue job from AWS console

The Glue job can be configured from the stack

Demo

https://vimeo.com/677054610

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
bin		bin
cdk.out		cdk.out
lib		lib
node_modules		node_modules
test		test
.DS_Store		.DS_Store
README.md		README.md
cdk.json		cdk.json
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serverless ETL using AWS Glue

Useful commands

How to reproduce

Demo

About

Releases

Packages

Languages

arpan65/aws-serverless-etl

Folders and files

Latest commit

History

Repository files navigation

Serverless ETL using AWS Glue

Useful commands

How to reproduce

Demo

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages