This repository contains my submission for the Final Assignment Project: Working with NoSQL Databases. The original files were provided by the IBM Skills Network as part of the Introduction to NoSQL Databases course on Coursera. I have made modifications to fulfill the project requirements.
-
You are welcome to use this repository as a reference or starting point for your own project.
-
If you choose to fork this repository, please ensure that you comply with the terms of the Apache License and give proper credit to the original authors.
As a data engineer at a Data Analytics Consulting Company, we take pride in our ability to efficiently handle data in any format on any database platform. The analysts in our offices rely on me to move data from external sources into various databases, transfer data between different types of databases, and execute basic queries across various databases. It’s a critical role that ensures our analysts can focus on analyzing data without worrying about the technical intricacies of data movement and management.
- Replicate a Cloudant database
- Create indexes on a Cloudant database
- Query data in a Cloudant database
- Import data into a MongoDB database
- Query data in a MongoDB database
- Export data from MongoDB
- Import data into a Cassandra database
- Query data in a Cassandra database
- Task 1 - Replicate a local database into your Cloudant instance.
- Task 2 - Create an index for the
Director
key, on themovies
database using the HTTP API. - Task 3 - Write a query to find all movies directed by
Richard Gage
using the HTTP API. - Task 4 - Create an index for the
title
key, on themovies
database using the HTTP API. - Task 5 - Write a query to list only the
year
andDirector
keys for theTop Dog
movie using the HTTP API. - Task 6 - Export the data from the
movies
database into a file namedmovies.json
. - Task 7 - Import
movies.json
into mongodb server into a database namedentertainment
and a collection namedmovies
. - Task 8 - Write a mongodb query to find the year in which most number of movies were released.
- Task 9 - Write a mongodb query to find the count of movies released after the year 1999.
- Task 10 - Write a query to find out the average votes for movies released in 2007.
- Task 11 - Export the fields
_id
,title
,year
,rating
anddirector
frommovies
collection into a file namedpartial_data.csv
. - Task 12 - Import
partial_data.csv
into cassandra server into a keyspace namedentertainment
and table namedmovies
. - Task 13 - Write a cql query to count the number of rows in the
movies
table. - Task 14 - Create an index for the
rating
column in themovies
table using cql. - Task 15 - Write a cql query to count the number of movies that are rated 'G'.
Before proceeding, complete the steps in the How to Create an IBM Cloudant Instance document to create a free multi-tenent Cloudant instance.
Note: This repo can also be used for local NoSQL database migration and analysis. You must have MongoDB Server and Apache Cassandra already installed. Refer to the links below:
Start the MongoDB and Cassandra processes according to your environment.
Begin the Final Assignment Project: Working with NoSQL Databases.
- IBM Skills Network © IBM Corporation 2023. All rights reserved.