Skip to content

Latest commit

 

History

History
31 lines (22 loc) · 2.29 KB

README.md

File metadata and controls

31 lines (22 loc) · 2.29 KB

Gaming Industry Analysis of last 40 years

A Data Analysis of a Video-Game Industry Dataset


Requirements for the project

The project has been done in the Jupyter enviroment coded in Python (3.8). All the required packages and libraries have been mentioned in requirement.txt for your referal.

The data file would contain 2 files and one of them will have the 'improved' tag, this dataset consists a modified version of the orignal dataset wherein I have searched for the missing publisher column values.


Motivation

This project was a part of an evaluation assignment of Udacity's Data Scientist Nanodegree. The students were to select a dataset, come up with some questions and answer them through data analysis of the dataset, and post their findings through a blogpost. The dataset I chose is present on Kaggle, representing gaming industry information from 1984-2020 in reference to the video-game titles that have been released. The code answer's the following questions:

  • Which Genre has become most popular?
  • Which Platform has been the most popular to play games on?
  • How the Sales Trend for games has been for the past 40 years?
  • Publishers with most Global Sales along with which country offers the maximum of these sales?

Summary

The answers to the above questions and my findings can all be found in my blog post right here. Please do refer to it.


Licensing, Authors and Acknowledgements

The orignal data set is by Gregory Smith who had done a web scrape of the VG Charts. Recently Rush Kiruni took the same dataset and added more columns to it.


If you do find this repository useful, why not give a star and even let me know about it!

Feel free to express issues and feedback as well, cheers!