Skip to content

Latest commit

 

History

History
45 lines (26 loc) · 2.02 KB

README.md

File metadata and controls

45 lines (26 loc) · 2.02 KB

Sales_analysis_python

Data analysis of sales of electronics products over a period of 12 months using Jupyter notebook.

In this Project I will use Python Pandas & Python Matplotlib to analyze and answer business questions about 12 months worth of sales data. The data contains hundreds of thousands of electronics store purchases broken down by month, product type, cost, purchase address, etc.

I start by cleaning our data. Tasks during this section include:

~Droping NaN values from DataFrame ~Removing rows(duplicates) based on a condition ~Changing the type of columns (to_numeric, to_datetime, astype) ~Once I cleaned up our data a bit, I will move the data exploration section. In this section I will explore 5 high level business questions related to our data:

Q1.What was the best month for sales? How much was earned that month?

image

Q2.What city sold the most product?

image

Q3.What time should we display advertisemens to maximize the likelihood of customer’s buying product?

image

Q4.What products are most often sold together?

image

Q5.What product sold the most? Why do you think it sold the most?

image

To answer these questions I used different pandas & matplotlib methods. They include:

->Concatenating multiple csvs together to create a new DataFrame (pd.concat) ->Adding columns ->Parsing cells as strings to make new columns (.str) ->Using the .apply() method ->Using groupby to perform aggregate analysis ->Plotting bar charts and lines graphs to visualize our results ->Labeling our graphs