Comparing Gender Gap Across Degree Categories

image12s

In this project, we’ll explore how we can communicate the nuanced narrative of gender gap using effective data visualization. We’ll try to compare the gender gap of men and women in different categories.Randal Olson, a data scientist at University of Pennsylvania, has cleaned the data set and made it available on his personal website dataset.

Analysis

The data set contains the percentage of bachelor’s degrees granted to women from 1970 to 2012 in various fields.The data set is broken up into 17 categories of degrees, with each column as a separate category.

You can visit my Github repo for the complete code.

Let us have a glimpse of few rows and columns, the values represent percentage of women in various categories.

Year Agriculture Architecture Art and Performance Biology Business
1970 4.229798 11.92101 59.7 29.08836 9.064439
1971 5.452797 12.00311 59.9 29.3944 9.503187
1972 7.42071 13.21459 60.4 29.81022 10.55896
1973 9.653602 14.79161 60.2 31.14791 12.8046

Data  Visualisation

Let’s generate line charts for four-degree categories on a grid to encourage comparison between men and women.

The first two degree categories are dominated by men while the latter degree categories are much more balanced.

s1

By spending just a few seconds reading the chart, we can conclude that the gender gap in Computer Science and Engineering is quite large, on the other hand, there seems a significance rise of women in the fields like Agriculture and Architecture.

A more balanced gender gap 

Be it Business or Math, the gender gap is becoming less and getting more balanced.

studt_gender_gap

Go ahead and compare other categories, follow the Github link for the code Github.


Now, let’s look at a different perspective, let’s find what changes have come from 1970 till 2012?

download34

The change is evident, from a percentage share of nearly 5 % in Agriculture(which had the least share ) in 1970 to 50% in 2012.Women are certainly marching ahead then Men. You will see the similar trend in Business as well.

Let us see what changes have come in STEM fields over the year

Many scholars and policymakers have noted that women have historically been underrepresented in the fields of science, technology, engineering, and mathematics (STEM fields).

download33

Biology with its upward trend shows a significant improvement and a lot more presence of women, Engineering and Computer Science, on the other hand, is still dominated by men.

Conclusion

By spending just a few seconds reading the chart, we can easily compare the gender gap in various categories and analyze the trend over the year.That’s the power of Data Visualisation.

Thanks

Analysing Pixar Movies

pixar-movies

Pixar Animation is one of the most well-known animation studios in the world, with hits like Toy Story 3, Finding Nemo, Monster’s Inc, and A Bug’s Life, Pixar’s movies are adored by kids for their charming characters and by adults for their wit

In this project, I’ll explore the ups and downs of Pixar over the years primarily using data visualization. Data visualization is especially useful in this case since our sample size, only 15, is low and we can glean more general insights from exploring the data visually.

Analysis

You can visit my Github repo for the complete code and the dataset.

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
%matplotlib inline
# reading the dataset
pixar=pd.read_csv("C:/Users/hp/Downloads/PixarMovies.csv")
pixar

The DataSet

Here are some of the columns in this dataset, PixarMovies.csv:

Year Released — the year the movie was released.
Movie — the name of the movie.
RT Score — the Rotten Tomatoes rating for the movie.
IMDB Score — the IMDB rating for the movie.
Metacritic Score — the Metacritic rating for the movie.
Opening Weekend — the amount of revenue the movie made on opening weekend (in millions of dollars).
Worldwide Gross — the total amount of revenue the movie has made to date.
Production Budget — the amount of money spent to produce the film (in millions of dollars).
Oscars Won — the number of Oscar awards the movie won.

Cleaning the data

#cleaning data removing % from domestic % columns and converting it to float
#making score out of 100 for imdb instead of 10
pixar['Domestic %']=pixar['Domestic %'].str.rstrip('%').astype('float')
pixar['International %']=pixar['International %'].str.rstrip('%').astype('float')
pixar['IMDB Score']=pixar['IMDB Score']*10

Adding new features to the dataset

# Adding new features
pixar['Profit']=pixar['Worldwide Gross']-pixar['Production Budget'] 
pixar['Domestic Profit']=pixar['Domestic Gross']-pixar['Production Budget']
pixar['International Profit']=pixar['Profit']-pixar['Domestic Profit']

Data Visualisation

How do the major review site rate Pixar movies?

# now analysisng critics score
critics_review=pixar[['RT Score','IMDB Score','Metacritic Score']]
critics_review.plot(figsize=(10,6),grid=False,linewidth=2)
plt.title("Critics Review")
plt.show()

pixar2

From the previous plot, it seems like the review site Rotten Tomatoes gives Pixar consistently higher ratings. Let’s generate a box plot to explore the question:

How are the average ratings from each review site across all the movies distributed?

#analysing through boxplot
critics_review.boxplot(figsize=(10,6),grid=False)
plt.title("Critics Review")
plt.show()

pixar3

From the above plot, it becomes clear that Rotten Tomatoes have consistently given higher scores.The most spread out scores are given by Metacritic ranging from right under 60 to right under 100.IMDB gives an average rating of 80.

The Profit Scenario

#The Profit Senario
pixar.sort_values('Profit')[['Profit','International Profit','Domestic Profit']]
.plot(kind='bar',figsize=(15,6),grid=False)
plt.title("The Profit Senario")
plt.show()

download4

Pixar studios have a terrific reputation when it comes to business, every single movie released has made profits, this is incredible.Toy Story 3 being the most successful movie in terms of business, followed by Finding Nemo. Movies which did not do well in the domestic market were still profitable due to its international presence worldwide. Cars 2 failed to impress in the Domestic market but did well overseas.

The International market

# how pixar took up the international market 
pixar_year=pixar.set_index("Year Released")
pixar_year[['Domestic %','International %']]
.plot(kind='bar',figsize=(15,6),grid=False)
plt.title("Comparing Domestic market vs International market year on year")
plt.show()

Does Pixar have a worldwide reach?

download7

Pixar always had a wider reach, from 2001 Pixar made more Profits from the overseas market than their domestic market. Their overseas popularity has always increased year on year. This says a lot about their worldwide popularity.

Let’s draw a stacked bar chart for better understanding:

# comparing domestic vs international collection
pixar[['Domestic %','International %']].plot(kind='bar',figsize=(15,6),stacked=True,grid=False)
plt.title("Domestic collection vs International collection in %")
plt.show()

download6

Spend sometimes visually exploring it. You’ll notice that there’s been a general decrease in the proportion of revenue that was made domestically. 

Opening Weekend Collection

#Comparing Production Budget vs Opening Weekend Collection
op=pixar[['Opening Weekend','Production Budget']]
op.sort_values('Opening Weekend').plot(kind='bar',figsize=(15,6),grid=False)
plt.title("Comparing Production Budget vs Opening Weekend Collection")
plt.show()

download5

The plot shows a very interesting result. Almost all movies collect more than 50 % of their cost in the opening weekend. 

The big stage: Oscars

#The big stage: Oscars
pixar[['Oscars Nominated','Oscars Won']]
.plot(kind='bar',figsize=(15,6),grid=False)
plt.title("The big stage: Oscars")
plt.show()

download8

9 out 16 movies produced by Pixar have one at least one oscar award. Isn’t that great feet.The movie WALL-E has received a maximum of 6 oscar nominations followed by Toy Story 3, Up, Ratatouille with each 5 nominations.

Toy Story 3, Up along with The Incredibles have won 2 Oscars.

Cars 2 and Monster University are the only movies which did not receive any oscar nominations.

Conclusion

The study brings into light the strong presence of Pixar in the international market. It’s growing popularity. The profit machine for its shareholders and receiving accolades at the Oscars.

Thank you

Design a site like this with WordPress.com
Get started